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Abstract. Cost register automata (CRA) were proposed by Alur et 
all as an alternative model for weighted automata. In hope of finding 
decidable subclasses of CRA, they proposed to restrict their model with 
the copyless restriction but nothing is really know about the structure 
or properties of this new computational model called copy less CRA. 

In this paper we study the properties and expressiveness of copyless CRA. 
We propose a normal form for copyless CRA and we study the properties 
of a special group of registers (called stable registers). Furthermore, we 
find that copyless CRA do not have good closure properties since we show 
that they are not closed under reverse operation. Finally, we propose a 
subclass of copyless CRA and we show that this subclass is closed under 
regular- lookahead. 


1 Introduction 

Weighted automata are an extension of finite state automata for computing 
functions over words [5] . The main idea is that every transition of the automaton 
is mapped to an element of a fixed semi-ring. A run of the automaton on a word 
is then mapped to the product of its transitions and the value of an automaton 
on a word is the sum over all runs. By choosing a suitable semiring such as the 
min-plus semiring this allows to compute the minimal costs of a run over a word 
or similar quantitative properties of word languages. Weighted automata have 
been extensibly studied since Schiitzenberger [14] and their expressiveness [4,9], 
decidability [10, 1[, extensions [4], and applications [11, 3[ are very well known. 

Recently, Alur et all [2] introduce the computational model of cost register 
automata (CRA), an alternative model to weighted automata for computing 
functions. The main idea of this model is to enhance deterministic finite au¬ 
tomata with registers that can be combined with semiring operations, but they 
cannot be used for taking decisions during a computation. The output of CRA 
over an input is then given by a hnal function over the registers content. In [2], 
Alur et all shows that CRA are equally expressive than weighted automata, that 
is, they define the same class of functions than weighted automata. 

The main advantage of introducing a new but equivalent model is that it allows 
to study natural subclasses of functions that could not be proposed from the 


classical perspective. This is the case for the class of copyless CRA that where 
proposed in [2] in hope of finding decidable subclasses of weighted automata. The 
idea of the so-called copyless restriction is to use each register at most once in 
every transition. Intuitively, the automaton model is register-deterministic in the 
sense that it cannot copy the content of each register similar than a deterministic 
finite automaton that cannot make a copy of its current state. Despite that this 
is a natural and interesting model for computing functions, research on this line 
has not been pursued further and nothing is known until yet. 

In this paper we embark in the first study of the structure and expressiveness 
of copyless CRA. We start this research by proposing a normal form for copyless 
CRA that, given any total order on the set of registers, one can construct an 
equivalent copyless CRA such that during transitions the content of each register 
is transfered to lower registers with respect to the total order. The normal form 
of a copyless CRA shows how the registers content flows from higher to lower 
registers and how registers are used as “aggregators”. This normal form motivates 
to divide the set of registers into two types: stable and iioii-stable registers. 
Non-stable registers store and operate its content like any register but they are 
unstable in the sense that at some point on a run they pass its content to lower 
registers forgetting what they have computed so far. In contrast, stable registers 
never pass their content to lower registers and they are always receiving and 
aggregating the content of non-stable registers. We show that from any state in 
a copyless CRA in normal form, one can force the automaton to read a word 
and “reset” the content of non-stable registers. This implies that during a run 
the output of a function depends directly from the content of stable registers 
and non-stable registers are just acting as temporary reservoirs. Finally, we also 
study the behavior of stable register during an automaton cycle. We show that 
when an automaton cycle is repeated, stable registers are updated as (pseudo) 
geometric series. 

We use the previous results to study the expressiveness of copy less CRA. In 
particular, we show that copyless CRA are not closed under reverse operation, 
namely, there exists a function / defined by a copyless CRA whose reverse func¬ 
tion /” (i.e. /”(w) = /(re”) where w” is the reverse word of w) is not definable 
by any copyless CRA. This unpleasant property of copyless CRA shows that 
the computational model is asymmetric with respect to the orientation of the 
input. This implies that copyless CRA do not enjoy good closure properties and, 
moreover, a potential logical characterization of the class of function defined by 
copyless CRA seems unlikely. 

The non-expressibility result of copyless CRA explained above motivates the 
introduction of subclass of copyless CRA that has good closure properties. This 
subclass is called bounded alternation copyless CRA (BAC-CRA). We show 
that the bounded alternation restriction of copyless-CRA imposed good closure 
properties. In particular, we show that BAC-CRA are closed under regular look¬ 
ahead, a desirable property for any one way computational model (i.e. it is 
symmetric with respect to the future). The proof and construction that shows 
that BAC-CRA is closed under regular look-ahead is non-trivial and it requires 


the introduction of an internal data structure in order to keep parallel runs with 
finite memory and copy less registers. 

The paper is organized as follows: in Section 2 and 3 we give the background 
and dehnitions of copy less CRA with some examples. In Section 4 we show 
our structural results regarding copyless CRA like its normal form and stable 
registers. Then in Section 5 we study the expressiveness of copyless CRA and 
show that they are not closed under reverse. In Section 6 we introduce the class 
of bounded alternation copyless CRA and show that they are close under regular 
look-ahead. 

2 Preliminaries 

In this section, we summarise the notation and definitions used for regular lan¬ 
guages, finite automata, and weighted automata. 

Finite automata. A finite automaton [7] over E* is a tuple A = {Q, E, S, qq, F) 
where Q is a finite set of states, SqQxExQ is a. finite transition relation, qo 
is the initial state and F is the final set of states. A run p of A is a sequence of 
transitions of the form: 

where (pi,ai+i,Pi+i) e 6 for every i < n. We say that p (like above) is a run of A 
over a word w = ai... an if po = 9o- Furthermore, we say that p is an accepting 
run if p„ 6 F. A word w is accepted by A if there exists an accepting run of A 
over w. We denote L{A) the language of all words accepted by A. 

A finite automaton A is called deterministic if 5 is a function 5 : Q x E ^ Q. 
When A is deterministic, we extend the function S from letters to words and 
we denote this extension hy S* : Q x E* Q, i.e., 6*{q, e) = q and S*{q, w ■ a) = 
S{6*{q,w),a). 

Semirings and functions. A semiring is a structure S = (S', ©,©,0,1) where 
(S, ©,©) is a commutative monoid, (S, ©,1) is a monoid, multiplication dis¬ 
tributes over addition, and 0©s = sO0 = © for each s € S. If the multiplication 
is commutative, we say that S is commutative. In this paper, we always assume 
that S is commutative. For the sake of simplicity, we usually denote the set of 
elements S by the name of the semiring S. As standard examples of semi-rings 
we will consider the semi-ring of natural numbers IN1(+,-) = (N,+,•,0,1), the 
min-plus semi-ring lNloc.(min,+) = (Noo,rain,+, oo,0) and the max-plus semi-ring 
lNl_oo(max,+) = (IN_oo,max,+, -oo,0) which are standard semi-rings in the field 
of weighted automata [-5]. 

We denote by F the class of all functions from E* to S. We say that a function 
f e F is definable by a computational system A (i.e. weighted automata, CRA) 
if f{w) = |A](u>) for any w e E* where [A] is the semantics of A over words. 
For any word w, let w'^ denote the reverse string. We say that a class of functions 
F c is closed under reverse [2] if for every f ^ F there exists a function f^^F 
such that f^(w) = for all w e E*. 


Weighted automata. Fix now a semiring S and a finite alphabet S. A weighted 
automata over S [13,5] is a tuple A = {Q,E,E,I,F) where Q is a finite set of 
states, E : Q X S X Q ^ S is a. weighted transition relation, and I,E : Q ->■ S is 
the initial and final function, respectively. Usually, if E(p,a,q) = s, we denote 
this transition graphically by p q. A run p of A is a sequence of transitions: 

„ _ „ Wfl „ W2/S2 Wn/Sn 

P - <]0 —*■ 9l —»• ••• —>■ 9n- 

where Wi + 0 for all i <n. We say that p is a run of A over a word w = 0102 ... a„ if 
it also holds that I{qo) + 0. Moreover, a run p like above is accepting if J(go) © 
and E(qn) + 0. In this case, the weight of an accepting run p of A over w is 
defined by |p| = /(go) ® 1^1=1 Si © E{qn). We define Run^(w) as the set of all 
accepting runs of A over w. Finally, the weight of A over a word w is defined by 

Ml(u;)= ^ IpI 

peRun^ (it;) 


where the sum is equal to 0 if Run^(?ii) is empty. The set C{A) = {w e E* \ 
Run^(r(;) + 0 } is called the language of A. 

A weighted automaton A is called unambiguous if |Run^(r(;)| < 1 for every 
w 6 E* and is called finitely ambiguous if there exists a uniform bound N such 
that I Run^(w)| < N for every w e E* [15,8]. Furthermore, A is called polynomial 
ambiguous if the function |Run_ 4 (-)[ is bounded by a polynomial on the input 
length [15,8]. For the special case when the number of runs of A are bounded by 
a linear function, we say that A is linear ambiguous. 

Trim assumption. For technical reasons, in this paper we assume our finite 
automata, weighted automata or cost register automata (see Section 3) are al¬ 
ways trim, namely, all its states are reachable from some initial states (i.e., they 
are accessible) and they can reach some final states (i.e., they are co-accessible). 
It is worth noticing that, since verifying if a state is accessible or co-accessible 
reduced to checking reachability in the transition graph [12] and this can be 
done in NLOGSPACE, then we can assume without lost of generality that all 
our automata are trimmed. 


3 Copyless cost register automata 

In this paper we study the expressibility of copyless cost register automata. 
Cost register automata (CRA) were introduced in [2] as an alternative model 
for computing functions over words. In contrast with weighted automata, CRA 
is deterministic model based on register. An advantage of CRA with respect 
to weighted automata is that it allows a fine grained understanding on how 
functions are computed. For example, the copyless-restriction (defined below) 
constrains a CRA to only use a register once per transition. This kind of restric¬ 
tions admit a better understanding of subclasses of function that are included 
in the hierarchy of weighted automata but they cannot be naturally defined. 


In this section, we recall the definitions of cost register automata and the 
copyless restriction. For this, we need to also give the definitions of expressions 
and substitutions over a semiring that are standard in the area. 

Variables, expressions, and substitutions. Fix a semiring S = (S', ©, ©, ©, 1) 
and a set of variables X disjoint from S. We denote by Expr(d:’) the set of all 
syntactical expressions that can be defined from X, constants in S, and the 
syntactical signature of S. An expression in Expr(A’) is basically a formula in 
S where variables in X are missing values. Eor any expression e e Expr(A’) we 
denote by Var(e) the set of variables in e. We call an expression e € Expr(A’) 
without variables (i.e. Var(e) = 0) a ground expression. For any ground expres¬ 
sion we define [e] € S to be the evaluation of e with respect to S. 

A substitution over X is defined as a mapping cr : A -> Expr(A). We denote 
the set of all substitutions over X by Subs(A). A ground substitution cr is a 
substitution where each expression a{x) is ground for each x e X. Any substi¬ 
tution can be extended to a mapping a : Expr(A) ->• Expr(A) such that, for 
every e e Expr(A), (T(e) is the resulting expression e[a] of substituting each 
X € Var(e) by the expression a(x). For example, if a{x) := 2x and a{y) = 3y, 
and e = X + y, then (T(e) = 2x + 3y. By using the extension ct, we can define 
the composition substitution cti o a 2 of two substitution cti and CT 2 such that 
tTi o < 72 ( 2 ;) = d'i(a 2 (x)) for each x e X. 

A valuation is defined as a substitution of the form 1 /: X ->■ S. We denote the 
set of all valuations over X by Val(A). Clearly, any valuation ly composed with 
a assignment a defined an expression without variables that can be evaluated as 
liy o a{x)J for any x e X. 

In this paper, we say that two expressions ei and 62 are equal (denoted by 
ei = 62) if they are equal up to evaluation equivalence, that is, \v o ei] = [1^0 62] 
for every valuation v € Val(A). Similarly, we say that two substitutions ai and 
tT 2 are equal (denoted by cti = 172 ) if < 71 ( 7 :) = a 2 {x) for every x e X. 

Copy less restriction. We say that an expression e e Expr(A) is eopyless if for 
all X € A, a{x) uses every variable from A at most once. For example, x- (y + z) 
is copyless but x - y + x ■ z is not copyless (i.e. x is mentioned twice). Notice that 
the copyless restriction is a syntactical constraint over expressions. Furthermore, 
we say that a substitution a is eopyless if (7(x) is a copy less expression for every 
X € A and Var((7(x)) n Var((7(?/)) = 0 for every different pair x,y e X. Similar 
than for expressions, eopyless substitutions are restricted in such a way that each 
variable is used at most once over the whole substitution. 

The following technical result will be used in Section 5 and the reader can 
avoid the proof for a first read. It shows the form of a eopyless expression when 
it is rewrite as a sum of monomials. 

Lemma 1. For any eopyless expression e, there exist an eguivalent expression 
e' of the form: 

e' = 0 {c^ O x] 

2=1 \ X€Xi / 


where Xi,...,is a sequence of different sets over X and ci,... ,Ck is a se¬ 
quence of values over S for k >0. 

Proof. The lemma is shown by induction on the size of e. For the base case, 
when e is equal to a constant or a variable, the lemma trivially holds by taking 
e' = e. For the inductive case, suppose that e = ei ® 62 where ® is either ffi or 
0 . By the inductive hypothesis we know that there exist expressions e'l and 62 
equivalent to ei and 62 , respectively, such that for j e { 1 , 2 }: 

e'j = ®\cl Q X 
»=1 { ^ 

where is a sequence of different sets over X and c{,...,c{. is a 

sequence of values over S for kj > 0. Given that e is a copyless expression, then 
we know that Var(ei) n Var(e 2 ). Without lost of generality, we can assume that 
Xil £ Var(ej) for j € {1,2} and i < kj. If not and there exists X^ f Var{ej), then 
this implies that cl must be equal to 0 (i.e. X^ is not contributing in e') and we 
can omit in e}. Then since X^ £ Var(ej) and Var(ei) n Var(e 2 ), this implies 
that: 

Xl^ n Xf^ = 0 for every ii < ki and 12 < fe. ( 1 ) 

Now we consider when ® is either ffi or 0 . If e = ei © 62 , then by considering 
e' = ei ffi 62 the lemma is proved given that by ( 1 ) all sets of the form X^ are 
different for j e { 1 ,2} and i < kj. Otherwise, e = ei 0 62 and we get: 

e'l © 62 = 0 (c{ O a;) 0 0 (c- O x) 

= 00 (c!i © 0 x) = e' 

21 = 1 22 = 1 UX^ 

lo 


The last derivation holds by (1), i.e., we do not need to consider repetitions in 
the multiplication of two monomials since X}^ n Xf^ = 0. One can also check 
that all sets of the form Xf^ u Xf^ are different. Indeed, all sets Xf^ are pairwise 
different and the same holds for X^^. This means by (1) that all sets Xl^ u Xf^ 
must be different as well. Then e' is equivalent to e and it has the form stated 
in the lemma. □ 

Cost register automata. A cost register automaton (CRA) over a semiring 
S is a tuple A = (Q, 27, ^,(5, go,r'Oi/i-) where Q is a set of states, 27 is the input 
alphabet, A is a set of variables, S ■ Q x E ^ Q x Subs(A) is the transition 
function, go is the initial state, j/q ■ A" ->• S is the initial valuation, and p : 
Q ->• Expr(A) is the final output function. A configuration of A is a tuple 



(g, I/) where q ^ Q and ly 6 Val(d:’) represents the current values in the variables 
of A. Given a string w = ai...a„ € S*^ the run of A over w is a sequence of 
configurations: {go, i^o) ( 911 *^ 1 )-^ ... Vn) such that, for every 1 < * < n, 

S{qi-i,ai) = {qi,ai) and Vi{x) = \vi-i oCTi(x)] for each x ^ X. The output of A 
over w, denoted by [^](w), is [j4(p(gn))]- 

A CRA A is called copyless if for any transition 6{qi,a) = {q 2 ,(j) and for any 
state (j € (5, it holds that cr is a copyless substitution and is a copyless 
expression. In other words, every time that registers from A are operated, they 
can be use just once. In the following, we give some examples of copyless CRA 
and compare their expressiveness with weighted automata. 

Example 1. Let S be the max-plus semiring [l\l_oo(max,- 1 -) and E = {a,b}- Con¬ 
sider the function /i that given a word ic e A* it computes the longest substring 
of &’s. This can be easily defined by the following CRA Ai with two register x 
and y. 



Ai stores in the a;-register the length of the last suffix of b’s and in the y-register 
the length of the longest substring of 5’s seen so far. After reading a 6-letter 
Ai updates x by one (i.e. the 6-suffix has increased by one) and it keeps y 
unchanged. Furthermore, after reading a a-letter it resets x to zero and updates 
y by comparing the substring of 6’s that has just finished (i.e. previous x-content) 
with the length of the longest substring of 6’s (i.e. previous y-content) that has 
been seen so far. Finally, it outputs the maximum between x and y. 

One can easily check that the previous CRA satisfies the copyless restriction 
and, therefore, it is a copyless CRA. Indeed, each substitution is copyless and 
the final output expression max{x,y} is copyless as well. 

In [2] it was shown that copyless CRA contains the class of all unambiguous 
weighted automata (UWA), a robust subclass of weighted automata that has 
many characterizations and good decidability properties [8,9]. One can easily 
show that the class of copyless CRA contains also the class of finitely ambigu¬ 
ous weighted automata (FAWA) [15,8], a subclass that is more expressive than 
UWA [8] and still has good decidability properties [6] . Interestingly, the previous 
example shows that copyless CRA strictly contains the class of FAWA. Indeed, 
it was shown in [8] that /i is a function that is not definable by any FAWA. This 
means that copyless CRA are strictly more expressive than FAWA. 

What functions are definable by copyless CRA? The previous example sug- 
gestz that copyless CRA could contain the class of linear ambiguous weighted au- 




tomata given that /i is definable by a weighted automata whose non-determinism 
is linearly bounded with respect to the size of the input. Unfortunately, this does 
not hold in general as we will show in Section 5: there exists linear ambiguous 
weighted automata that are not definable by any copy less CRA. 

The previous discussion suggests the following question: are copyless CRA con¬ 
tained in the class of linearly ambiguous weighted automata? The next example 
shows that the answer to this question is negative since there exists functions 
definable by copyless CRA that are not definable by linear ambiguous weighted 
automata. 


Example 2. Again, let S be the max-plus semiring IN_oo(max, +) and S = {a, b, 4f\ 
Consider the function /2 such that, for any w € E* of the form 
with Wi e {a,b}*, it computes the maximum number of a’s or 6’s for each sub¬ 
string Wi (i.e. max{|wi|o, |wi|f,}) and then it sum these values over all substrings 
Wi, that is, f 2 {w) = T,t=o maxUrcija, |wi|f,}. One can check that the copyless CRA 
A 2 define below computes f 2 '- 


# 


x,y,z: 


x,y.= Q 

z '■= z + max{a:, y} 


X ’■= X +1 



In the above diagram of A 2 , we omit an assignment if a register is not updated 
(i.e. it keeps its previous value). For example, for the a-transition we omit the 
assignments y '■= y and z '■= z for the sake of presentation of the CRA. Similarly, 
we also omit the assignment x ■= x and z ■= z for the 6-transition. 

The copyless CRA A 2 follows similar ideas than Ai: the x- and y-register 
counts the number of a’s and 6’s, respectively, of the longest suffix without ^ 
and z-register stores the partial output without considering the last suffix of a’s 
and 6’s. When the last substring Wi over {a, 6} finishes (i.e. it comes a ^-letter 
or the input ends), then A 2 sums the maximum number of a’s or 6’s in Wi to z 
(i.e. z := z -I- maxja::, y}). 


Function /2 is a counterexample for showing that copyless CRA are contained 
in the class of linear ambiguous weighted automata. In [8] it was shown that /2 
is definable by a weighted automata but it is not definable by any polynomial 
weighted ambiguous automata. Interestingly, the previous example shows that 
copyless CRA can defined /2 and, therefore, they can defined functions that are 
beyond the scope of polynomial ambiguous automata. 




4 Structural properties of copyless CRA 


In this section we develope some machinary that will be useful in Section 5 
when we show the non-expressibility of copyless CRA. In Section 4.1 we show 
that copyless CRA can be assumed to do not contain the 0-constants in any 
transition, initial valuation or final funtion. Then, in Section 4.2, we propose a 
normal form for copy less CRA and, in Section 4.3, we highlight some special 
subset of registers (called stable registers) that are fundamental to understand 
the run of a copyless CRA. Finally, in Section 4.4 we study the asymptotic 
growing of the register content of a copyless CRA when a cycle is iterated. 

Most of the results in this section are technical and the reader can skip most of 
the proofs for a first read. However, we would like to highlight the normal form 
of copyless CRA (Section 4.2) and the results on stable registers (Section 4.3) 
which are of independent interest from the non-expressibility result. 


4.1 Removing zeros from CRA 

We say that an expression e e Expr(A’) is reduced if e = 0 or the ©-constant is not 
mentioned inside e. It is straightforward to show that for any expression e there 
exists an equivalent expression e* that is reduced. Indeed, one can construct 
inductively an equivalent expression by using the following reductions: e ffi 0 = e 
and e 0 0 = 0. Then by reducing each subexpression recursively, the resulting 
expression is either 0 or do not use the ©-constant at all. Further, note that if e 
is copyless, then its reduced expression e* is copyless as well. 

Let / : E* ^ S be a function definable by a copyless CRA. We say that / is 
a non-zero function if f{w) + 0 for all w € E*. The following result shows that, 
without lost of generality, we can assume that all constants in a copyless CRA 
are different from © whenever the function defined by A is a non-zero function. 

Proposition 1. Let A be a copyless CRA such that [A] is a non-zero func¬ 
tion. Then there exists a copyless CRA A' such that its initialization function, 
substitutions and final output functions are reduced and different from ©. 

Proof In this proof, we show how to avoid keeping a © value in registers that 
where forced to be © (i.e. forced by a substitution a such that a{x) = © for 
some register x). The idea is to store in the state which registers are equal to 
©. Let A = {Q, E, X, 5, qo, vo,pi) be a copyless CRA. Define a new copyless CRA 
A! = {Q',E,X,6,qQ,VQ,p.') such that: 

— Q' = Q X 2*^ is the new set of states, 

— Qg = (go, ^o) where Sq is the set of registers such that a: e S' iff = ©, and 

— Aq is the same as vq for registers in A \ So and for registers x € Sq we define 
;zo(r) = 1 (or any other constant from S). 


For the definition of 5' and /r' we need to first introduce some notation. Let 
S £ X and e be an expression over X. We define the expression e[iS'] that is a 
reduced expression d* where d is the result of taking e and replacing all registers 
a; € S' n Var(e) by 0. 

Now, we are ready to define 6' and p'. For every {q, S) € Q' and ae E,we define 
6'{{q,S),a) = {{q 'where S{q,a) = {q',a), S' is the set of all registers x 
such that cr(a;)[S] is equal to 0, and a' is defined for every x € X a,s follows: 


cr'(a;) 


J (T(a;)[S] if X S 

11 if X € S 


Finally, we define the output function u' such that, for every (q, S) € O', it holds 
that p'((q,S)) = /r(q)[S]. 

It is straightforward to show that A and A define the same function and that 
t'o. S' and /i' do not use the ©-constant. Note that there cannot exist a reachable 
state (q, S) e Q' with fJ,'{{q, S)) = 0, otherwise the output of the function defined 
by A will be 0 for an input in E* which contradict the fact that |^] is a non-zero 
function. □ 


4.2 The normal form of a copyless CRA 

In this subsection, we define a normal form for copyless CRA given a linear order 
on registers. Let A = {Q,E,X,S,qo,vo,fi) be a copyless CRA and let < be any 
predefined linear order over X. We say that A is in normal form with respect 
to < if for every transition of the form S{p,a) = (q,cr), it holds that x <y for all 
y e Var(cr(x)). In other words, all the variables mentioned in (t(x) are greater 
or equal than x with respect to <. For example, the copyless CRA in Example 1 
is in normal form with respect to the order y < x. 

The idea behind this normal form is to prevent unexpected behaviors on copy¬ 
less CRA. For example, consider the following copy less CRA that is not in normal 
form with respect to the order x < y: 



The initial state in the above copyless CRA is qi and the initial valuation vq is 
equal to r'o(x) = I'oiy) = 0. Here, registers x and y count the number of a’s and 
b’s, respectively. However, depending on the state both registers can have either 
the number of a’s or the number of b’s. It is clear that one would like to avoid 








this type of behaviour since this cyclic information can be easily stored in finite 
memory. Intuitively, one register should always contain the number of a’s and 
the other register the number of 6’s. 

In the next result, we show that for every copyless CRA there always exists an 
equivalent copyless CRA in normal form. The idea behind the proof is to permute 
the substitutions for registers and keep in the state the reverse permutation to 
encode the substitutions. Comming back to our running example, the normal 
form of this copyless CRA is the following: 


X '■= X 

y:=y + l 


a 


X '■= X + I 

y-=y 


X '■= X 

y:=y + l 



a 


X ■= X + 1 

y-=y 


From now on we assume that copyless cost-register automata are given with 
a linear order on the registers so we can order the registers, i.e., write X = 
{xi, . .. ,Xn} with the order defined by Xi < Xj iS i < j. The next proposition 
shows that every copyless CRA can be transformed into an equivalent copyless 
CRA in normal form with respect to <. 

Proposition 2. For every copyless CRA A there exists a copyless CRA in nor¬ 
mal form A' such that they recognize the same function. Moreover A! and A 
have the same set of registers and the same linear order on them. The number 
of states in A' can be bounded exponentially in the size of the automaton A. 

Proof. Let A = {Q,S,XA,qoXo,y) be a copyless CRA. We define a copyless 
CRA A' in normal form such that A' computes the same function as A. The 
idea of A' is to store a state qe Q and a permutation p of the set X such that, 
if {q,v) is the current conhguration of A over an input w, then ((q, is the 

configuration of A' over w and v{x) = iy'(p(x)). In other words, the content in v 
is still in A but the value n{x) of x is now in the register p{x) for every x e X. 
The permutation of registers’ content will allow us to keep the normal form in 

A'. 

Formally, let A' = {Q',S,X,6',qoXoA') where: 

Q' = Q X {p\ p is a. permutation of the set X} 

is the set of states, = (goAd) is the initial state where id is the identity 
permutation, and Vq = vq is the initial function. For the sake of presentation, 
let us show how the run of A' will correspond to the run of A before going 
into the definition of the transition function 5' and the output function p'. For 
an expression e € Expr(A’) and a permutation p over X, we define p{e) to be 





the expression e where the registers are replaced according to p. Let {q, v) and 
{q'jiy') be the configuration of A and A', respectively, after reading w € E*. 
Then we will show that: 

q = (q,p) for some permutation p and v{x) = A(p(x)) for every x e X. (2) 

Note that, for the word e, this correspondence holds since {qo, i^o) and ((qo) id), Aq) 
are the initial configuration of A and A', respectively, and zyg(id(a:)) = Vq{x) = 
vq{x). Furthermore, if we define p'{{q,p)) = p{p{q)) and show that (2) always 
holds, we will prove that A and A! computes the same function. Indeed, if the 
transition function b' preserves (2), then 

v'{li'{{q,p)y) = v'{p{p{q))) (by definition of p') 

= v{p{q)) (by Property (2) 

This will prove that the outputs of A and A' are the same. Therefore, in the 
rest of the proof we will show how to define 5' such that A' is a copyless CRA 
in normal form and its definition satisfies (2). 

Before defining <5', we need some additional definitions. For a copy less sub¬ 
stitution a and a permutation p both over X, define the set = {x € X \ 
Var(p(cr(x))) i= 0}, that is, the set of all variables x where p{a{x)) is a non¬ 
constant expression. Further, define the set S'^p = {min{Var(p(CT(x)))} | x e 
S'cr,p} and the function ^ : Sa,p ->■ S'^ p such that for all x € Sa,p- 

T°,p(x) = min{Var(p(cr(a:)))} (3) 

One can easily check that ^ is a bijective function from Sa,p to p. It is sur¬ 
jective by the definition of Sa,p and p, and injective by the copyless restriction 
over cr. To see the last claim, recall that any copyless substitution satisfies that 
Var(cr(x)) n Var(tT(?/)) = 0 and, in particular, Var(p(cr(x))) n Var(p(tT(?/))) = 0 
for any permutation p. In other words, this means that: 

T°,p{x) = mm{Var(p(cr(a;)))} + min{Var(p(cr(y)))} = T°p{y) 

for every pair x,y e Sa,p, that is, p is an injective function. Finally, given that 
p is a bijective function from Sa-,p ^ X to S'^. p £ X , we can extend ^ to a 
bijection Ta-,p : X ^ X such that Tcr,p{x) = T^^p{x) for every x e Sa,p- Of course, 
there might be many alternatives for extending ^ into a bijective function 
but we can choose any of these extensions (i.e. this decision is not important for 
the construction). 

We have now all the ingredients to define the transition function J'. For every 
p € Q, a € A, and permutation p over X, if S(p,a) = (q,cr) then we define 
S'((p,p),a) = {{q,Tc^p),a') such that: 

(j'{x) = p{(j{T-^p{x))) 

for every x ^ X. From the previous definition, we can easily check that a' is a 
copyless substitution. In fact, p o a is a copyless substitution for every copyless 


a (i.e. p is just a renaming of variables) and is just permuting the variables. 
Therefore, we can conclude that a' is copyless as well. 

Our next step is to show that a' is in normal form. Recall that a' is in normal 
form if for every x € X \t holds that x < y ioi every y € Var(CT'(x)). We prove 
this by case analysis by considering whether x ^ S'^ p or not. First suppose that 
X € p. Since x is in the codomain of and is an extension of p, we 
have that x = min{Var(p(cr(r“p(a:))))} by (3). Then if we replace p{<y{T~^p{x))) 
by cr', we get that x = min{Var((T'(a;))}. In particular, we have that x < y for 
every y e Var((T'(a;)). Now suppose that x i S'^ p. This means that x is not in 
the codomain of r^ p which implies that T~^p{x) i Sa,p- In other words, a'(x) = 
p{a{T~^p(x))) is a constant expression and, thus, a; < y for every y e Var(cr'(a;)). 
Since for both cases it holds that x < y ioi every x & X and y e Var(cr'(a;)), then 
we conclude that a' is in normal form. 

For the last part of the proof, we show by induction that 5' satisfies the cor¬ 
respondence (2) between A and A'. Let and {{p,p),iy!^) be the con¬ 

figuration of A and A1, respectively, after reading w e S* and suppose that 
Vn{x) = v'^{p{x)) for every x e X (i.e. inductive hypothesis). Furthermore, sup¬ 
pose that: 


(p,r'„)(g,i^„+i) and {{p, p),v!^) 

are the transitions for A and A’, respectively, after reading a new letter a e E. 
We prove the correspondence (2) between i^n+i and as follows: 


= KA'AcrAA)) 

= KipAAApA^^AA)))) 
= KipiAx))) 

= iyn(cr(x)) 

= iyn+i(x) 


(by definition of A+i) 

(by definition of a') 

(by composing r^^p and r^ ^) 
(by inductive hypothesis) 

(by definition of iy„+i) 


This proves that the transition function S' keeps the correspondence (2) between 
A and A! ■ Since it also holds for the initial configuration then by induction this 
shows that it holds for all configurations which proves that the outputs of A and 
A' are the same. □ 


Let A = {Q,E,X,6,qoj'^OiP) be a copyless CRA. As usual, we define the 
transitive closure of 6 as the function 6* ■ Q x E* Q x Subs(A’) by induction 
over the word-length. Formally, we define 5*{q, e) = (g, id) where id is the identity 
substitution for all g e Q and 5*(gi, w-a) = (gsjCrocr') whenever 6*(qi,w) = (g 2 ,cr) 
and S{q 2 ,a) = {q 3 ,a'). For a CRA A we define the set Subs(A) of all substitutions 
in A as follows: 


Subs(A) = { cr £ Subs(A’) | 3p,q e Q. 3w e E*. S*(p,w) = (q^a) } 


It is easy to check that, if all substitutions in S are copy less, then all substitution 
in Subs(yl) are also copyless. Furthermore, <5* and, in particular, Subs(^) also 
preserves the normal form, that is, for all a € Subs(^) it holds x < y ior all 
y € Var((T(x)). This can be easy proved by induction over the word-length. 
Assume that it holds for all words w of length at most n and let 5* {q, w) = 
Suppose we want to extend w with ae E and S*{q'^a) = {q",a'). By definition, 
we know that 5*{q,w-a) = {q”,aoa'). Then take ?/e Var((Tocr'(x)). By definition 
there exists a register z such that z € Yai{a'{x)) and y e Var(cr(z)). Since 6 is 
in normal form, we conclude that x < z < y. In other words, x < y for every 
y 6 Var(cr o a'(x)) and, thus, a o a' is in normal form. 


4.3 Stable registers and collapse substitutions 

Let A = (Q,E,A^,SjQo,!^0!M) be a copyless CRA in normal form with respect 
to a fix total order <. In a copy less CRA in normal form, the content of registers 
flows during a run from higher to lower register with respect to <. Unfortunately, 
this does not necessarily mean that the content of all registers will eventually 
reach the <-minimum register. For example, if all substitutions in A are of the 
form cr(x) = x®k for some fc € S, then each register will store just its own content 
during the whole run. Intuitively, in this example each register is “stable” with 
respect to the content flow of A. We formalize this idea with the notion of stable 
registers. Let cr be a copyless substitution in normal form (i.e. for each x e A, 
X <y for all y e Var(cr(x))). We say that a register x is a-stable (or stable on a) 
if X € Var(cr(x)). 

The following lemma shows that the composition preserves stability between 
registers. 

Lemma 2. Let a, a' be two copyless substitution in normal form. For any regis¬ 
ter x e X it holds that x is stable on a and a' if, and only if, x is (a o a')-stable. 

Proof. Suppose that x is stable on a and cr'. This means that x e Var(cr(x)) 
and X € Var(cr'(x)) and, thus, x € Var(cr o cr'(x)) by composition. For the other 
direction, suppose that x € Var(cr o a'). Then we know that there exists y such 
that X € Var(cr( 2 /)) and y € Var(cr'(x)). Since a and a' are in normal form, then 
X <y < X which implies that x = y and, thus, x is stable on a and a' . □ 

We generalize the idea of stable register from substitutions to copyless CRA. 
We say that a register x is stable in A (or just stable if A is understood from 
the context) if x is cr-stable for all transitions a e Subs(A). A register is called 
non-stable if it is not stable. By Lemma 2, it is enough to check that a register 
X is cr-stable for all letter-transitions S{p,a) = (g, cr) to know whether x is stable 
in A or not. 

For the next proposition, we need to recall some standard definitions of di¬ 
rected labeled graphs and bottom strongly connected components. For Q' Q Q 
we say that Q' is a bottom strongly connected component (BSCC) of A if (1) 


for every pair qi,q 2 e Q' there exists w e E* such that d*{qi,w) = {q 2 ,cr) for 
some substitution a and (2) for every q ^ Q' and w e S* if 6*{qi,w) = (< 72 ,c) 
then <72 e Q- Intuitively a BSCC Q' of ^ is a set of mutually reachable states 
such that there is no word that leaves Q'. We say that A is strongly connected 
if the whole set Q is a BSCC of A. In this section, we assume that A is strongly 
connected. 

By the previous proposition, we know that for every pair of states q, q' there 
exists a word that “collapses” the content of non-stable register into stable 
register. This motivates the following dehnition: a substitution a e Subs(^) is 
said to be collapse substitution if Var((T(a;)) = 0 for all non-stable registers in 
cr. With this definition, the previous lemma saids that for every pair of states 
q,q' there exists a word w®’* such that 6{q,w‘^'‘^ ) = {q',(j) and cr is a collapse 
substitution. From now on, ii q = q', we write w* and a'^ instead of w*’* and cr*’*, 
respectively. 

We say that a substitution a e Subs(^) is a collapse substitution in A if 
Var(cr(a;)) = 0 for all non-stable registers in a. Intuitively, collapse substitution 
makes all non-stable registers to forget its content. In this sense, non-stable 
registers are unstable and they can loose its current content with a collapse 
substitution. Notice that by definition stable register can never loose its current 
content. 

Proposition 3. Let A = {Q,E,X,StQot^oA) ® copyless and strongly con¬ 
nected CRA. Then for all q,q' e Q there exists a word and a substitution 
cr*’* such that (1) ^*(< 7 , u;*’* ) = (( 7 ',cr*’* ), (2) re*’* contains each letter in E, 
and (3) ct*’* is a collapse substitution in A. 

Proof. Let X = {xi,... ,Xn} he the registers of X in increasing order with respect 
to < and ( 7 , ( 7 ' two states in Q. We construct the word ru*’* by (inverse) induction 
starting from Xn and ending in xi. Specifically, for every i < n we will define 
a word re*’* and a substitution ct*’* such that the proposition holds for all 
non-stable register greater than Xi. Clearly the proposition will be shown by 
considering re*’* = re*’* . 

We start by the base case i = n and consider whether x is stable or not. If is 
stable, then take a word u and a substitution cr such that 5*{q, u) = {q', cr) and u 
contains each letter in E. Given that A is strongly connected we know that u and 
cr always exists. Then by defining re*’* = u and ct*’* = cr, the proposition holds 
for the stable register x„. Now, suppose that a;„ is non-stable which means that 
there exist a pair p,p' e Q and a word u such that rr) = (p', ct) and is non¬ 
stable in cr. Given that A is in normal form and is the maximum register with 
respect to <, this implies that Var(cr(a;„)) = 0. Pick two words ei,e 2 e E* such 
that S*{q,vi) = (p,(Ti) and S*(p',V 2 ) = (q',a 2 ) for some substitutions cti and 172 , 
and ei contains each letter in E. Again, we know that ei and V 2 always exists 
since A is strongly connected. Then define re*’* = vi-u-V 2 and cr*’* = ai ■ a ■ 0 - 2 . 
By construction, we know that ^*(( 7 , ru*’* ) = (( 7 ',cr*’* ) and w*’* contains all 
letter in E. For proving that Var(CT*’* (a;„)) = 0 , notice that x is non-stable on 



a. By Lemma 2 this implies that x is non-stable on ai o a o a 2 - Given that A is 
in normal form and Xn is the maximum register, we get that Var(tT(a;„)) = 0. 

For the inductive step, we suppose that and exist and show how 

to construct and that satisfy the proposition for registers greater or 
equal than Xi. Again, we consider whether Xi is stable or not. If Xi is stable, 
then by defining and the proposition trivially holds. 

Indeed, property (I) and (2) are satisfied by the inductive hypothesis and (3) 
is satisfied since it holds for every register greater than Xi (again by inductive 
hypothesis) and also for Xi given that Xi is stable. Thus, the interesting case is 
when Xi is non-stable. Suppose that Xi is non-stable and, therefore, there exist 
a pair p,p' e Q and a word u such that 5*{p,u) = and Xi is non-stable 

on cr. Let vi,V 2 e S* such that 5*{q,vi) = (p,ai) and 6*(p',vi) = (q',a 2 ) for 
some substitutions cti and (72 • Recall that vi and V 2 exist by assuming that A is 
strongly connected. Now, define: 

Q.q' q.q 

•Vi-U‘V2 

q-,q' q,q 

(Ji = o ai O cr O (72 

It is clear by construction that 6*{q, ) = {q\ ) and contains all letter 

in S. The last fact holds because we know (by induction) that contains 
all letters in E. To conclude the proof, we must show that Yar{af‘^ (x)) = 0 
for every non-stable register x > Xi. Let x be any non-stable register x > Xi 
(possibly Xi) and let a* = ai o a o (j 2 - First, note that all y e Var(cr*(x)) are 
non-stable. Otherwise, if y e Var(cr*(x)) is stable, then y € Var(cr*(y)) but 
this is impossible since Var((T*(x)) n Var((T*(y)) = 0 by the definition of being 
copyless. Therefore, we have that every register in Var(CT*(x)) is non-stable. 
Note also that Xi i Var(tT*(x)). This is true when x + Xi (i.e. A is in normal 
form) and also true when x = Xi since we know that Xi ^ Var(CT*(xi)) (i.e. Xi is 
non-stable). Then we have that all registers in Var(tT*(x)) are non-stable and 
strictly greater than Xi. This means that the inductive hyphotesis holds for i + \ 
and Yax{(jf_^^{y)) = 0 for all y € Var(CT*(x)). By composing and a*, we 
conclude that Vai(af’^(x)) = 0. This concludes the proof. □ 


4.4 Growing rate of stable registers in a loop 

Fix a copy less and non-zero CRA A = {Q, 6, qo , 1 ^ 0 , y)- Similar that in the 

previous section, assume also that A is in normal form with respect to the total 
order < and, furthermore, assume that A is strongly connected. We start this 
section with a trivial fact about copyless expression that will be useful during 
this section. 

Lemma 3. Let a,T e Subs(A) where a is a collapse substitution and let x be a 
a-stable and T-stable register. Then the expression a o t(x) is equivalent to an 
expression of the form (c O cr{x)) ffi d, where c, d € S and c # 0. 


Proof. We prove this by induction on the length of expression r(x). For the base 
step take t{x) = x. Then (jot{x) = <j{x) is equivalent to (l©CT(a;)) ©0. Suppose 
we can write such an expression (cQ x) ® d for t of length n. By the inductive 
assumption when t is of length n + 1 then a o t{x) can be written in the form 
{{cQa{x))®d)®a{f) or {{cQa{x))®d)Qa-{f). If / is a constant, then cr(/) © 

by the non-zero assumption. Otherwise, f = y for some variable in X - {y}. This 
means that y is non-stable in A and then a{f) = f for some constant /' i= ©. In 
either case we have that a{f) = f for some constant /' 0 and, thus, cr o t{x) 

can be rewritten as (c © cr(x)) © (d © /') and (((/' © c) © cr(x)) © (/' © d)), 
respectively. □ 

The goal of this section is to study the behavior of loops in A. A loop over 
a state g € Q in A is a word w € S* such that S*{q,w) = for some 

substitution a. The iteration of any loop is also a loop, that is, w” is a loop over 
q and 6*{q,w'^) = (q,a^) for any n. This motivates the study of the behavior 
of a when it is iterated a big number of times. As the next lemma shows for 
a copyless CRA A in normal form the substitution in A are converted into a 
collapse substitution when they are iterated a fix number of times. 

Lemma 4. Let a e Subs(A) be any substitution. Then there exists A^ > 0 such 
that is a collapse substitution in A. 

Proof. Suppose x is non-stable over a, i.e. x ^ Var(cr(a;)). Since A is copyless and 
in normal form then x < min(Var(CT(x))). But since min(Var(CT(x))) e Var(cr(a;)) 
and A is copyless then min(Var(CT(x))) ^ Var(CT(min(Var((T(a;))))) and thus 
min(Var((7(a;))) / min(Var(cr^(a;))). Applying this argument consecutively we 
get a increasing sequence of registers: 

x < min(Var((7(a;))) < min(Var(CT^(a;))) < min(Var(CT^(x))) < ... 

The number of registers is finite and then this sequence cannot be infinite. Thus 
there exists an N such that Var(cr^(x)) = 0. We conclude that it suffices to take 
N = \X\. □ 

From the previous lemma, we know that a non-stable register x over a become 
a constant when cr is iterated more than lAj-times. In the next proposition, we 
study the growing of stable registers when a collapse substitution is iterated. 
This will be useful to understand the behavior of copyless CRA inside their 
loops. 

Proposition 4. Let a € Subs(A) be a collapsing substitution and x be a a-stable 
register. Then there exist c, d € S with c # 0 .such that for every i > 0 we have: 

cr“^(a;) = (c* © cr(x)) © (d © 0 cd) 

j=o 



Proof. Since cr is a copyless and collapsing substitution from A, then for every 
y 6 Var(tT(a;)) either y = x or Yai(a{y)) = 0. Then consider the expression e 
that is equal to cr(x) where every register y x is replaced with [(t(j/)]. This is 
a copy less expression with only one variable x and A is non-zero. By Lemma 3 
(i.e. by replacing r by ct and a by the identity substitution) it can be rewritten 
in the form e* = (c© x) ffi d for some c,d € S and c + ©. 

We prove this claim by induction using the constants c, d. We start with the 
base step for i = 1 (for i = 0 is trivially true). The expression a o a(x) can 
be rewritten as cr o e* = (c © cr(x)) © d, because it takes the expression tr and 
substitutes x with a{x) and all other registers x with [t(x)]. 

For the inductive step, we have that oe* by the same argument used 

in the previous paragraph. Then by replacing cr*’*'^ with the inductive hypothesis, 
we get: 

a^^^ix) = (c©a“i(x)) ©d 

= (c © ((c* © ct(x)) © (d © 0pQ cd ))) © d 
= © ct(x)) © (d© 0*^1 c^)) © d 

= © ct(x)) © (d© 0)^Q cd ) 

□ 


The previous proposition shows that stable registers grow exponentially with 
respect of the number of times than a loop is iterated. In particular, when S = 
tKl_oo(niax,+) a stable register grow linearly. The next result is a refinement of 
Proposition 4 but in terms of the N_oo(max, +)-semiring. The lemma is technical 
and it will be used in the next section. The reader can avoid the proof for a 
first read. 

For the next lemma, recall that, by Proposition 3, for any q € Q there exists 
a word w’^ such that S*{q,w'^) = {q,cr'^) and cr^ is a collapse substitution over 
every non-stable registers in X. 


Lemma 5. Let A be a copyless and non-zero CRA in normal form over the 
N_oo(max,+) semiring. Furthermore, let q & Q and v + e be a loop such that 
S*{q,v) = {q,T) for some collapsing substitution r. Then for j € N big enough, 
there exists a substitution aj such that 6*{q,w‘^ ■ w'^) = {q,aj) where: 


aj o X{x) 


0(1) if X is non-stable 

max{ j ■ c +a^{x) + 0(1), j-d + 0{l) } otherwise. 


where A € Subs(.A) such that its size does not depend on j; and c, d € N are 
constants that depend on x but not on j or X. 


Proof. From now on we work with the IN_oo(max,+) semiring. Let q ^ Q and 
V t= e he a loop such that d*{q,v) = {q,T) for some collapsing substitution r. 


Recall that is a non-zero CRA and in this semiring 0 = -oo, thus we can 
assume that c > 0. By Proposition 4, we have that for every j > 1 it holds that: 






(x) = (4 Ot(x)) e (do 0c^) 

A;=0 

= maxjj • Cx + t(x), dx + liaxjfc • Ca^}| 
= inax{j-Cj;+ t{x), dx +(j-1)-Cj;} 


(by Prop. 4) 

(by definition of lN_oc.(max,+)) 
(by definition of max) 


= max{j • Cx + t{x) + 0 ( 1 ), j ■ Cx + 0 ( 1 )} (by using the 0 -notation) 

(4) 


Fix any substitution A € Subs(Al). Recall that w‘^ is the word and tr'^ is the sub¬ 
stitution from Proposition 3 for q. Then for any j > 1, consider the substitution 
aj such that S*{q,w‘^ ■ 4’^^ • w‘^) = {q,aj). We show next that aj o A satisfies 
the above properties for any j. For a non-stable register x in A, the result is 
straightforward. Indeed, A is in normal form and, thus, Var(A(x)) contains just 
non-stable registers. This implies that: 


(Tj o A(x) = cr'^ o 4 o cr'^ o A(x) (by definition) 

= cr'J o A(x) (Var(A(x)) contains only non-stable registers) 

= 0(1) (cr'^ and A do not depend on j) 

Suppose now that x is a stable register in A. We need to show that: 

tjj o A(x) = max{ j • c + cr^(x) -H 0(1), J • d-I- 0(1) } (5) 

for j big enough and some constants c, d € IN such that c, d do not depend on A. 
The expression aj o A(x) is equivalent to a‘^ o o cr'^ o A(x). In other words, 
it is equivalent to the expression cr'^ o A(x) where all registers y € Var(cr'^ o A(x)) 
are substituted with a’^ o tA^(j/). We start the proof by analyzing these last 
expressions. 

Let y 6 Var((T'J o A(x)) be any variable. If y is not a r-stable register, then 
Var(r(?/)) = 0 and thus cr'^ o A'^^{y) = 0(1). Otherwise, by (4) we know that 
= max{j • Cy + t(x) -i- 0(1), j ■ Cy + 0(1)} for some Cy € IN and, thus, by 
applying a‘^ on A'''^{y) we get: 

cr'^ o (y) = maxjj • Cy + cr'^ o t(x) -I- 0(1), j • Cy + 0(1)} (6) 

If y is a r-stable register, but non-stable on A (i.e. non-stable in general) then 
tr'J oT(y) is equal to a constant and does not depend on j. Thus we can estimate 
cr? o T(y) by 0 ( 1 ) and then ( 6 ) becomes: 


4o4+4y) = max{j • Cy + 0 ( 1 ), j-Cy-H 0 ( 1 )} 
= J-Cy+ 0 ( 1 ) 


Otherwise, y is a. stable register and, moreover, is a collapsing substitution. 
Then by Lemma 3 we know that o r^y) can be represented as: 


a'^ oT(y) = max{ c +a‘^{y), d' } 

= max{ cr'?(y)+0(1), 0(1) } (7) 

for some constants c',d' 6 IN not depending on j and where c' + -oo. Thus, by 
combining (6) with the previous equation we get: 

(T«oT'^+i(y) = max {j-c^+cr'^ oT(a:) + 0(1), j-Cj^ + 0(1)} (by (6)) 

= max{j • Cy + max{ cr«(y) +0(1), 0(1) } +0(1), }-Cy +0(1)} (by (7)) 

= max {j • Cy + cr'^( 2 /) + 0(1), j • Cy + 0(1), j • Cy + 0(1)} (by distribution) 

= max{j • Cy+cr«(y) +0(1), j-Cy +0(1 )} (8) 

Observe that a variable y e Var((T'J o A(a;)) is table if, and only if, y = x. We 
summary the three possible cases in the next equation: 


(T^ o T^^^{y) 


0 ( 1 ) if y is not r-stable 

- j-Cy + 0 ( 1 ) if y is non-stable 

max{j-cy + cr«(y) + 0 (l), j-Cy + 0 ( 1 )} ii x = y 

(9) 


Now we can prove (5). Recall that the expression Cj o A(a;) is the expression 
cr^ o A(x) where all registers y € Var(cr‘^ o A(a;)) are substituted with tr^ o'r'^’^^(y). 
First notice that by Lemma 3 the expression o A(x) is equivalent to maxjc" + 
a'^{x),d"} = max{cr'^(x) + 0(1),0(1)}. We can do these estimations because A 
does not depend on j. By combining this equation with the definition of aj we 
have: 


aj o A(x) = a'^ o o cr'J o A(a:) 

= tr'^ o o max{ tT'*(a:) + 0 ( 1 ), 0 ( 1 )} 

= = max{ aj(x) + 0(1), 0 ( 1 ) } 

Thus to finish the proof it suffices to show that: 

aj(x) = max{ J • c + tT'^(a:) + 0 ( 1 ), j-d+ 0 (l)} ( 10 ) 

for some constants c,d e N. Indeed, if (10) holds, then aj(x) + 0(1) = max}} • 
c + a'^{x) + 0(1),} -^ + 0(1)}. Similarly the value 0(1) can be estimated by the 
expression j ■ d + 0(1) so the last max operation is not needed. Notice that this 
does not change the constants c and d, which proves that they do not depend 
on A. 

To conclude the proof, we show that (10) holds for a stable register x. Recall 
that aj = a‘^ o o a‘‘. We will show that (10) holds even if we change aj by 



ocr' where cr' is any copyless substitution and x is cr'-stable. The proof is 
by induction on the size of (j'{x). For the base step we must consider ct'(x) = x 
(i.e. X is stable). Then: 

o'j(x) = o o a'(x) 

= a‘^oT^'^^(x) 

= max{ j -Co: +a'^(x)+ 0(1), j -Co: + 0(1) } (by (8)). 

Suppose that (10) holds for expressions a' of length n and we want to show that 
(10) for an expression cr''(x) of the form a-"(x) = <t'(x) ®f where @ is either + or 
max, X is cr'-stable and f is a expression where all their registers are non-stable 
(recall that cr" is copyless). By unraveling cr'^ o o a"(x), we get that: 

cr? o o cr"(a:) = o o (a (x) ® f) 

= (cr® o o cr'(a:)) ® (cr® o o f) 

= max{ j • c+cr®(a:) + 0(1), j • d + 0(1) } @ (cr® o o/) (by induction) 

On the last expression, it is easy to check that cr® o ° f is equal to a constant 
or to j ■ cf + 0(1) for some constant c/ + -oo. Indeed, by (9) we know that 
cr® o T'^'^^(y) = j ■ Cy + 0(1) for non-stable registers. Since / is an expression over 
non-stable register, one can show by induction over the size of / that of 

is either a constant or j -Cf + 0(1) for some constant c/ + -oo and j big enough. 

We start with the case when a'^or^^^of e N. Then cr®or^'^^o/ e 0(1) and for @ 
equal to + or max we have cr®oT-'^^ oct"(x) = max{ j-c+a'^(x)+0(l), j-d+0(l) } 
for j big enough. Suppose now that ct ® o ° f = j • Cf + 0(1). Then we must 
consider when @ is + or max operation. For the former we have: 

CT® o o ct"(x) = max{ J • c + cr®(a:) + 0(1), d • j + 0(1) } + j ■ Cf + 0(1) 

= max{ j ■ (c + cf) +cr®(x) + 0(1), j ■ (d + Cf) + 0(1) }, 
and for the latter we have: 

cr® o o a”(x) = max{ max{ j ■ c + cr®(x) + 0(1), j ■ d + 0(1) }, j ■ Cf + 0(1) } 

= max{ j-c+cr®(x) +0(1), j ■d + 0(l), j-Cf + 0(l) }. 

= max{ j ■ c + cr®(x) + 0(1), j ■ max{d, c/} + 0(1) }. 

The number of induction steps depends on the size of the expression cr"(x) which 
for a"(x) = cr®(x) does not depend on j. This concludes the proof. □ 

5 Non-expressibility of copy less CRA 

In this section, we show two facts about the expressiveness of copyless CRA: (1) 
we show that the class of linear ambiguous weighted automata is not contained 


in the class of copyless CRA and (2) copyless CRA are not closed under reverse. 
We derive this two conclusions by showing a function / that is definable by 
a copyless CRA but its reverse (i.e. f^{w) = f{w^)) is not definable by any 
copyless CRA over the same semiring. Interestingly, /’' is definable by a linear 
ambiguous weighted automata and, therefore, from this we also conclude (1). 

Consider the function /e given by the following copyless CRA B over S = 
{a, 6,#} and N_oo(max,+): 


a; := a; + 1 

y-=y 


x,y: 



To understand the output of B, let us first define /g formally. For any word 
w e S* let k he the number of ^ symbols in w. Furthermore, let rii and rrii be 
the number of a’s and 6’s, respectively, between the i-th and i + 1 occurrence of 
^ in w for 0 <i < k. Additionally, let no, mo, n/c, be the numbers of a’s and 
b’s before and after the first and last ^ in w. Then by the dehnition of B, one 
can easily check that /g is given by the following definition: 

r k 

/b(w) = maxi rrij + ^ n^ | j e {0,... fc} 

^ \ i=i+l 

for w + e and = 0. From this definition, one can also give a formal definition 

of /g (i.e. the reverse function of /g) which is given by reversing the roll of n^ 
and rrij in the definition of /g. Formally, one can easily check that the definition 
of /g is defined as: 


fsiw) = max I ^ rii + mj | j € {0 ,... k} 

^ { i=0 

In Theorem 1 below we show that is not definable by any copyless CRA. 
Intuitively, a copyless CRA A that computes will have to store the number 
of a’s in a register Xa and compare Xa with the last sequence of &’s between 
^-letters. Since a copyless CRA can used the content of a register just once per 
transition, then A will loose the content of Xa and forget the number of a’s seen 
so far. The previous argument is formalized in the next result. 

Theorem 1. The function /g is not recognizable by any copyless CRA. 





Proof. Suppose there exists a copyless CRA A = which 

computes the function /g. Since the function /g is non-zero (i.e. f§{w) + -oo 
for every w 6 S*) then by Proposition 1 we can assume that the transitions, 
initial and final functions in A do not use -oo. Without lost of generality, we 
can also assume that A is a strongly connected automaton (see Section 4.3). 
If not, we can always change our analysis by adding a word wq from the initial 
state to a BSCC and then construct a counterexample word from there. Formally, 
let (5*(go,w^o) = (9oWo) where gg is a state inside a BSCC Q' £ Q of A. We can 
always redefine A as follows: qg is the new initial state, and the new initialization 
function zzq is defined as Agi^x) = 1 ^ 0 ° o'o(a:). It is straightforward to check that 
for every word w the new automaton constructed from A will return the output 
value of A over wg-w. Thus, for the rest of the proof we assume that A is strongly 
connected and, therefore, all the results from Section 4 hold. 

We start the proof by analyzing the behavior of A on words containing loops of 
just one letter. Since Q is strongly connected, then there exists a state Qa, a word 
Va = with Ua > 0 and a substitution such that S*{qa,Va) = {qa,Ta). We can 
assume by Lemma 4 that is a collapse substitution, that is, Var(Ta(x)) = 0 
for all non-stable registers x in t^. In addition, define a sequence of words: 


Wa{j) = • w'*" 


such that 5*{qa,Wa{j)) = (Qo) CTa) some substitution cr^ (i.e. cr^ depends on j). 
Recall that is the word defined in Proposition 3 for the state qa- By Lemma 5 
we know that there exist constants c“ and such that for j big enough: 




0(1) if a; is non-stable 

max{ j - 0 ^+ cr'^“(x) - 1 - 0(1), j ■d^ + 0(1) } otherwise. 

( 11 ) 


Analogously, by the definition of A we can find a state qb in A, a word vt = b'^’’ 
for some Ub > 0 and a collapsing substitution Tf, such that 6*(qb.,Vb) = (qb^Xb). 
Then similar to the sequence Wa{j), we can define the sequence of words Wb{j) = 
w<ii> such that S*(qa,Wb(j)) = {qb,cri) for a substitution that satisfies 

(11) but for new constants and d^. 

The next step is to understand the growth of stable registers when we repeat 
Waif) s-times or, more important, when we compose {<JaY- For any j > 1 and 
s > 1, define the sequence of words: 


Wais,j) = 


and cr*’^ = (cr'^“ o o cr'Ja Lg^ x be a stable register on A. For s = 2 one can 

easily check the following calculations: 


(T^’-^ (x) = cr'^“ o o cr'^“ o o (t'^“ (x) 

= a'^'^ o o cr{(x) 

= cr«“ o (max{ j • (x) + 0 ( 1 ), j ■< + 0 ( 1 ) }) 


(by definition) 

(by ( 11 )) 


max{ j ^ . u 0 (1) } 

max{ j-c^+max{ j • + cr«“(x) + 0 ( 1 ), + 0 ( 1 ) } + 0 ( 1 ), j-d^ + 0 ( 1 ) } (by ( 11 )) 

max{ 2 j.cS + a^“(x)+ 2 - 0 ( 1 ), j • c> j • d> 2 • 0 ( 1 ), j-d> 0 (l)} 




(by dominance) 


In general, for any natural number s > 1 if x is a stable register it is straightfor¬ 
ward to check that: 



We are ready to define the word for which we prove that A give the wrong 


output. Let Wo be a word such that S*{qo,wo) = {qa,<Jo). Let also and 


yjqh,qo. i^g ^jj^g “collapsing words” between qa and q^ from Proposition 3. For any 
_) > 0 dehne the sequence of words: 

W{s,j) = Wo • Wo(s, j) • Wb(j^) • • Wa(j) 

It is straightforward to check that for any s,j >0 the word w{s,j) has a run 
that goes to qa with a, then it cycles in qa with Wa(s,j), next it goes from qa 
to qb with cycles again times in qb with Wb{j'^), then again back from 

qb to qa with rc'Jt.'jo finally it cycles j-times with the word Wa(j)- In other 
words, w(s,j) behaves as expected by cycling and jumping between qa and qb- 
In the sequel, we show that A outputs wrong value over w{s,j) when we fix x 
and we take j big enough. 

First we estimate the correct value f§'{w{s,j)) for any s,j > 0. By Proposi¬ 
tion 3 we can assume that Wa{s,j), Wa{j) and Wb{j) contain the letter Thus 
the loops of a’s and b’s in the words Wa{s,j), Wa{j), and Wb{j) are all separated 
by The only fragments of w(s,j) that depend on j are these loops. From the 
definition of these loops, one can easily check that the number of a’s in Wa{j) 
is j -ria + 0{1) and the number of &’s in Wbij"^) is • Ub + 0(1). Furthermore, 
similar to Wa{j), the number of a’s in Wa{s,j) is equal to s-j'n-a +0(1). Recall 
that /^(w(s,j)) = maxtjXilo Wi + m*}, where rii and rrii are the numbers of 
a’s and &’s, respectively, between the ^-letters and t is the number of maximal 
subsequences without It is easy to see that: 


/ b ( w ^( s > j )) = rib- f +na- s-j+ 0{s) 


(13) 


where 0{s) represents a fix number of a’s (i.e. do not depend on j) that are 
presented in wg, w‘^°-, or . Notice that the last suffix Wa{j) of a’s is not 
contributing in the sum. This happen because the sequence Wb{j'^) is overshad¬ 
owing the last sequence Wa{j), i.e., the max-operator consider the number of &’s 
in instead of the number of a’s in Wa{j)- The rest of the proof is to show 

that A does not output the right value on ui(s, j). The reason of this misbehav¬ 
ior of A with respect to is because if A is summing the sequence of a’s 

before Wb{P) then it must also sum the sequence of a’s after Wb{j^) which, by 
the previous calculations, we know that this should not happen. 

We estimate now the values in the registers of A after reading the word 
w{s,j) for any j big enough. By the construction of w{s,j) we know that 
6*(qg^w^sA)) = {Qa,<^w{s,j)) and, thus, the current value in the registers of 
A after reading ui(s,j) it is given by the assignment: 


S,j Qa,Qh i Qh-,Qi 

1^0 ° CTwisj) =vo°crgo a A ° cr ° cr 


o tr 


3 

a 


For all non-stable registers y this expression is equivalent to < 7 ^( 2 /) (by Lemma 5) 
which is equal to a constant 0(1) that does not depend on j. For a stable register 
X, the story is much more complicated. We evaluate the expression aiu(s,j)(x) 
step-by-step to estimate this value. First, by (11) we get that: 


ai(x) = max{ j-c“-t-cr'?“(a:)-HO(l), j-<-HO(l) } 


Then by composing o with we get: 


o O CT„(x) 


al 0 0-9'”'?“ O (max{ j ■ + a‘>Ax) + 0(1), j +0(1) }) 

max{ j-cl+al o o cr?" (a:) + 0(1), j +0(1) } 

max{ j -c^ +max{ f ■ cl + cr9i(a:) +0(1), f ■ dl +0(1) } + 0(1), j-d^ +0(1) } 
max{ J ■ d+f ■ cl + a9‘(x) + 0(1), j ■ cl+f ■ dl + 0(1), j • d> 0(1)} 


The next-to-last substitution holds by Lemma 5 and by considering A = tT9'”9“ o 

•2 

cr^o. The next step is to compose o cr9“’9'> with o q denote 


this composition by cr 

^ W{S,J) 

K'ilj) = ° ° i^b ° ° ^ii^)) 

= oct 9“'9‘omax{ j-cl+f-4 + a’^^{x)+0{l), 

j-c:+f-dt + 0{l), j-d:+0{l)} 

= max{ j-cl+f -cl + cr^’-^ ocr'?“’«'’ ocr‘?'>(a;) + 0 ( 1 ), 
j.c>/.d^ + 0(l), j-< + 0(l)} 

= max{ j-cl+f ■cl + Taax{ s • j • c“ + cr'?“ (a:) + 0(s), (s - 1) • j • c“ + j • < + 0(s) } + 0(l), 
j.c>/.d? + 0(l), j-dl + 0{l)] 


= max{ 


= max{ 


j-Ca+3 


J-Ca+J 


■cl + s-j-cl+ a«“(a;) + 0(s), j ■ cl+f-cl + {s-1) ■ j ■ cl+j ■ d^ + 0(s), 
•d^ + 0(l), j-dl + 0{l)] 


(s + 1). j . cl + (x) + 0(s), j • (a • c> + f ' cl + 0(s), 

j-c^+/-< + 0(l), j-< + 0(l)} 


Similar than the previous calculations, the third substitution holds by Lemma 5 
and by considering A = o cr'^'’. Finally, we compose with i/q o ao{x), 

Given that vq o ao(x) is a constant with respect to j, that is vq o (Tq(x) = 0(1) 
then we have that vq o cruj(s j) over a stable register a; in ^ is equal to: 

t'o ° (^w{s,j){x) = max{ (s + 1) - j-c^ +j^ - Cb +0(s), j-{s-cl + dl) + f ■ cl +0(s), 

j-cl+f-dl + 0{l), j-dl + 0{l)} (14) 

For the rest of the proof, we fix a value for s that does not depend on j (i.e. the 
exact value for s will be defined towards the end of the proof). We study now 
the behavior of the function [yl](w(s, j)) for j big enough and we start with the 
form of vq oCTu,(s j)(x) for a stable register x. For a fix s we denote the quadratic 
functions on j by: 

5i(j) = c^/ + c^(s + l)•J+0(l) 
gUj) = c^/ + (c^s + dD•J + o(l) 
gUd) = dl-f + cl-j + o{i) 
glU) = d^J+0(l) 


where 0(1) denotes some constant that do not depend on j (it probably depends 
on s but recall that s is now fixed). Then for every stable register x we have 
t'o o <Jw(s,j){x) = xaaxf^^{gl{j)}. Notice that if dl < cl then for j big enough it 


holds that g^ij) < 9i{j) and g^ij) is dominated by gf{j). This motivates the 
definition of when and 0 otherwise. Furthermore, redefine 

< 7 f(j) by: 

9Uj) = c^/ + (c^(s + l) + eD•J+0(l) 

By the previous discussion, it is easy to see that vq o iJ)} 

for the new definition of g^ij)- 

We study now the output function [^]('u;(s, j)) = z/q ° <yw{s,j) ° dida)- By 
Lemma 1, we know that the copy less expression fJ.{qa) can be presented as an 
expression of the form: 


9{9a) 


max 

2=1 



E- 

x^Xi 


where Xi,..., Xk is a sequence of different sets over X and is a se¬ 

quence of values over IN for fc > 0. Recall that by the previous analysis we know 
that vq o (Jiu(sj)(x) is equal to a constant whenever x is non-stable or equal 
to ma.xf^^{gf{j)} whenever a: is a stable register for j big enough. Then by 
composing i/q o a^(^s,j) with g{qa) we get: 


Ml('w(s,j)) = V0°a^(^s,j)° Kda) 


= 1 ^ 0 ° ° max + E } 

xsXi 

= max {li+ Y, ^0° <^w{s,j){x) } 

xsXi 

= max { m, -I- ^ max{ (j) } } 
2=1 2=1 


(15) 


x^Yi 


where W,..., Y/j is a new sequence of subsets of stable registers and mi ,..., mh is 
a sequence of values over IN for fc > 0. The last equality holds since vq ocr^(^s,j)ix) 
is constant for non-stable registers and we can regroup the sets Xi and sum the 
constants li to form new sets of stable registers Yi and new constants m^. 

The next step is to further simplify the output [^](i(;(s, j)). For this, note that 
|^](r(;(s, j)) is the sum and maximization of quadratic functions over j. Then for 
j big enough there exists a constant i* <h and a partition oiYi* = ZiaZ 2 taZ’ 3 i±iZ 4 
such that: 


M](u;(s,j)) =m..+ E gUj)+ E 52 0)+ E 530) + E 5l0) 

X€Zi X^Z'2 X€Zz X€Z4 

In the previous claim, i* is the maximum argument where (15) is maximize and 
the partition Yi* = a Z 2 a Z 3 tt) Z 4 is the division of Yi* where each function gf 
dominates for j big enough. Thus, |.A](ui(s, j)) is a sum of quadratic functions 
and by summing common j-terms we can reduce [^](i(;(s, j)) to a polynomial 
of the form A- Y + B ■ j + C. Intuitively the value A ■ p should correspond to the 
number of 6 ’s in w{s,j) and B-j to the number of a’s in w{s,j). In the last part 


of this proof, we analyze the i?-coefficient and compare it with the corresponding 
coefficient in 

Recall from (13) that the output of over w{s,j) is equal to rifj^+na-s- 
j + 0{s). This suggests that if the output j)) is correct then we should 

have that B = Ua- s. By adding the linear coefficients of gf{j) for i e {1,2,3,4} 
and X € Zi we get that: 

B = E <-(s + l) + E (Ca-(S + 1) + e^) + E Ca + E < 

X^Zi X€Z2 X^Zs X^Zi 

= (s + l). E < + Z < + Z + Z 

X€ZiUZ2 X€Z2 X^Z^ X€Z4 


Given that B should be equal to Ua ■ s then this implies that Da;€ZiuZ 2 must 
be strictly less than Ua (i.e. ExeZiuZ 2 < ^a)- Otherwise, B will be too big with 
respect to Ua- s. This implies that ExeZiuZ 2 *^0 - - 1 and then: 

(s + 1)- E ^ (s + l)-(na-l) 

X€ZiUZ2 


By replacing this fact in the definition of B, we can over-approximate the B- 
coefficient as follows: 

B < (s + l).(n,-l) + y eS + y + y dS 

X^Z2 X€Zz X^Z4 

< s-na-s + na-l + E(ea+Ca+0- (16) 


For the last bound, recall that all constants e“, and are non-negative and, 
thus, T,xix{^a + <^0 d'a) is greater or equal than the sum of a subset of X. It 
is important to notice that inequality (16) is independent on how we choose s, 
namely, for any value of s we will have that B must satisfy (16). 

We are ready to show that, by suitably choosing a fix value for s, B will not 
be equal to Ua-s. Formally, choose s > Uq -1 + + + This with (16) 

implies that B <na-s which is a contradiction. In other words, A is not counting 
all the a’s in w{s,j) for j big enough. This proves that A does not output the 
right function. □ 


In the previous proof we have shown that fg is not definable by any copyless 
CRA. Interestingly, the reverse of this function fig is definable by a copyless CRA 
as it was shown at the beginning of this section. From this, we can conclude that 
the class of copyless CRA is not closed under reverse. 

It is also interesting that fg is definable by a linear ambiguous weighted au¬ 
tomata. To see this, consider the following weighted automata A' 


a / 1 a / 0 

&,#/0 6/1 a , 6,#/0 



In the previous diagram, the initial and final states are marked with an incoming 
and outgoing arrow, respectively, and they start and finish with value 0. States 
that are not marked start or finish with -oo. One can easily check that the 
weighted automaton M computes the function /g and, moreover, is linear 
ambiguous. This shows that there exist functions that are definable by linear 
ambiguous weighted automata, but they cannot be defined by a copyless CRA. 

We summarize the conclusions of Theorem 1 in the next corollary. 

Corollary 1. 1. The class of linear ambiguous weighted automata is not con¬ 

tained in the class of copyless CRA. 

2. The class of functions recognizable by copyless register automata is not closed 
under reverse. 

6 Bounded alternation copyless CRA 

We define the alternation of an expression e e Exprg(A’) as maximum number of 
shifts between © and 0 in all branches of the parse tree of expression e. Formally, 
for any expression e we define /®(e) = 1 if e = ei 0 62 , /©(e) = 1 if e = ei © 62 
and /(e) = 0 otherwise. The alternation Alt(e) of an expression e is recursively 
defined as Alt(e) = 0 for every ee Su X and Alt(ei 0 62 ) = maxi^ji 2 }{Alt(ed + 
/®(ei)} + 1 for ® € {©,©} and any expressions 61,62 £ Exprs(A’). We add 1 to 
distinguish expressions not using any binary operators from the expressions using 
exactly one type of binary operator. We denote by Expr^(A’) all expressions with 
alternation bounded by N. 

Theorem 1 and Corollary 1 show that Copyless CRA are not closed under 
reverse, that is, the execution of Copyless CRA is asymmetric with respect of 
an input. A close look to the copy less CRA B define in Section 5 shows that the 
unbounded alternation of the final expression is the reason why the function is 
not closed under reverse. This fact motivates the definition of bounded alternating 
copyless CRA (BAC-CRA), a subclass of copyless CRA. It turns out that the 
class BAC-CRA have good properties for a computational model. In particular, 
BAC-CRA is closed under regular-lookahead (see the definition below) which is 
the main result of this section (see Theorem 2). 

Formally, we say that a Copyless CRA A has bounded alternation if there 
exists A^ e N such that, for every w = oi... a„ e S*, if (go, 1 ^ 0 ) (?», t'n) 

is the run of A over w and 5{qi-i,ai) = for every 1 <i <n, then it holds 





that VQoaio.. .oan° ^ Expr^ (‘^) i that is, the number of alternations of all 

ground expressions that are outputs of A is uniformly bounded by a constant. 
A CRA A is called a bounded alternating copyless CRA (BAC-CRA) if A is 
copyless and has bounded alternation. Most of the Copyless CRA examples 
presented in this paper have bounded alternation. For instance, Examples 1 
and 2 are part of the class of BAC-CRA. 

Interestingly, BAC-CRA are closed under regular lookahead A CRA with reg¬ 
ular look-ahead (CRA-RLA) [2] is an extension of CRA where the machine 
can take decisions by reading the remaining suffix of the string in a regular 
manner. Formaly, let REGi; be the set of all regular languages over S. A 
CRA-RLA is a tuple A = (Q, S,X,S,qoTi'o,h') where Q, S, A, qo, vq, and 
fi are defined as before and 6 : Q x REGi; ^ Q x Subs(A) is a partial tran¬ 
sition function with finite domain restricted as follows: for a fixed state q let 
S{q,Li) = {qi,ai), 6 {q,L 2 ) = {q 2 ,<J 2 ),. ■ ■ ,S{q,Lk) = {qk,(^k) be all transitions 
with q in the first coordinate and Li,..., e REG^;. Then the languages 
Li,..., Lk are pairwise disjoint (i.e. Li n = 0). Note that the last require¬ 
ment forces that on a given word the automaton A is deterministic. Given a 
string w = ai... On ^ S*, the run of A over w is a sequence of configurations: 

(<?OAo) ^ {qi,Vi) ... ^ {Ln,Vn) 

such that for 1 < i < n, S{qi-i,Li) = (qi, Ui), w[i, •] € Li and i'i{x) = \vi-i o cri(a:)] 
for each x e X. The output of A over w is defined as usual, i.e. [^](r(;) = 
|i4,(/r(g„))]. The extension of the classes of copyless and bounded alternation 
GRA-RLA are defined similarly. 

The main result of this section is to show that BAC-CRA with regular looka¬ 
head defined the same class of functions as normal BAC-CRA. 

Theorem 2. For every BAC-CRA A with regular look-ahead there exists a 
BAC-CRA A' without regular look-ahead that computes the same function, that 
is, [A](iy) = [A'](w) for every w ^ E*. 

The proof of Theorem 2 goes in two steps: (1) we show that for every BAC- 
CRA A with regular look-ahead there exists an unambiguous BAC-CRA A' 
equivalent to A (Lemma 7) and (2) for every unambiguous BAC-CRA A' there 
exists a BAC-CRA A!' equivalent to A' (Lemma 8). Clearly, if we show (1) and 
(2), then Theorem 2 will be proved. 

An unambiguous CRA is a non-deterministic CRA such that for every word 
w there exists exactly one run that is accepting. Formally, a non-deterministic 
CRA is a tuple A = {Q,E,X,A,qo,i'Q,F,fj,) where Q, E, X, qo, vq, and fi are 
defined as before, A<^QxExQx Subs( A) is a finite transition relation and F is 
the set of final states. Additionally, we assume that for every q,q' e Q and a e E 
there exists at most one a e Subs(A) such that {q,a,q',a) e A. Given a string 
w = ai... On ^ E*, a, run of A over w is a sequence of configurations: 


(go,J^o) {qi,vi) ^ 


{qn,Vn) 


such that for 1 < j < n, {qi-i,ai,qi,ai) 6 A and r'i(a;) = \vi-i o ai{x)J for each 
X € X. Furthermore, a run of A over w (like the one above) is called accepting 
if qn 6 F. Then, we say that a non-deterministic A is unambiguous if for every 
w i E* there exists exactly one accepting run of A over w. The output of an 
unambiguous CRA A over w is defined as [A](w) = where {qmVn) 

is the final configuration of the only accepting run of A over w. The definition 
of unambiguous copyless CRA or unambiguous BAC-CRA are a straightforward 
extension of the definition above. 

The following lemma is a well-known property of unambiguous finite automata 
and, in particular, of unambiguous CRA that will be useful during the proof 
Lemma 7. 

Lemma 6. [13,16] If P o-nd p' are two runs of an unambiguous finite automaton 
A over w e E*, then the last states of p and p' are different, that is, p(|w|) + 
p'{\w\). 

The previous lemma implies that the number of active runs of an unambiguous 
automaton until any position of the input is always bounded by the number of 
states. This observation will be useful in the proof of the second step. First we 
show next that BAC-CRA extended with regular look-ahead can be expressed 
with unambiguous BAC-CRA. 

Lemma 7. For every BAC-CRA A with regular look-ahead there exists an un¬ 
ambiguous BAC-CRA A' that computes the same function, that is, [A](?i') = 
[A'](w) for every w e E*. 

Proof. Let A = {Q,E ,X, A,TZ,qQ,VQ, p) be a copyless CRA-RLA. To represent 
the transition function A finitely, let Li,..., be all the regular languages in 
the finite domain of A and, for each i < N, let Ai = {Pi, E,5i,p],Fi) be a finite 
state automaton such that Li = C{Ai). To unify the automata structure of all 
Ai, define the set of states P = Pi^ ^^e transition function <5 = 6 i, and 

the set of final state F = \i)]ii,Fi, that is, P, S and F are the disjoint union 
of states, transitions and final states, respectively, of the automata Ai, ... ,An- 
Then for every i < k, define the automaton TZi = {P, E,5,q],F). By the definition 
oi P, 6 and F, one can easily see that Li = C{TZi) for every i <k. 

We are ready to define the unambiguous BAC-CRA equivalent to A. Let A' = 
{Q', E, X, A',q'^,UQ,F', p') be a BAC-CRA such that: 

— Q' = <5 X 2 ^ is the set of states, 

— Z\' £ Q' X A X Q' X Subs(A’) where {{q,S),a,{q',S'),a) e A' iff there exist 
{q,Li,q' ,a) € A for some i < N, and a surjective function f ■ S u [p]} S' 
such that S{s,a) = f{s) for every s € ^ u {p]}, 

— <?o = (90,0), 

— F' = {{q,S)\ScF}, and 

— p' -Q' ^ Expr(A’) where p'{q, S) = p{q) for every (< 7 , S) € Q'. 


The main idea of A' is to guess and choose, at each letter, the right transitions 
in A that is satisfied by the current suffix. In each state {q, S) of A' we keep 
the current state q of a run and a subset S' of P that includes all regular tran¬ 
sitions that has been taken so far by the run on q. Then we have a transition 
{{q,S),a,{q' e A' if there exist a transition (q, Li,q' ,(t) e A (i.e. a tran¬ 
sition from q to q') such that we can extend each state in S u {p^} to a state 
in S'. Note that, in order to start simulating the hnite automaton TZi over the 
suffix, is also included on the set of states that are updated. Finally, if the 
last state {q, S) of a run satisfies S £ F, then we know that all suffixes during 
the run satisfies the regular look-ahead of the transitions and, then, the state 
{q, S) is final. 

We show first that A' is unambiguous. By contradiction, suppose that A' is 
not unambiguous, that is, there exist lu = oi... a„ e S* and two different ac¬ 
cepting runs p and p' of A' over w. Let i < n be the least position such that 
p{i) = p'(i) = {q,S) but p{i + f) = {qi,Si) + ((? 2 ,-S' 2 ) = p'{i + !)• We know 
that this position exists since, by construction, it holds that p(0) = p'(0). Let 
{q,L,qi,ai) 6 A and {q,L',q 2 ,a 2 ) € Z\ be the transitions that witness the tran¬ 
sitions {{q,S),a^+l,{ql,Sl),al) e A! and ((g,S'),aj+i, (g 2 ,S' 2 ),o- 2 ) e A!. Since 
both runs are accepting, then it is straightforward to show by induction that 
w[i,-] € h and w[z,-] e L' for the suffix w[i,-] of w at position i. Then we have 
a contradiction since, by definition of CRA-RLA, we know that L n L' = 0. We 
conclude that A! must be unambiguous. Note that during the construction we 
did not change the assignments cr in the transitions. For this reason, we can also 
conclude that A! is a copyless CRA with bounded alternation. 

For the last part of the proof, we have to show that [A](iu) = [A'](r(;) for every 
w € A*. It is easy to verify that for every run (qo, vq) (g„, r'„) of A over 

w e S*, there exist sets Si such that the sequence ((< 70 , «5'o), Sn),i^n 

is an accepting run of A' over w. For a given word w the sets Si are uniquely 
determined by the transitions A and 5. Thus, [AKiu) = \_Vn{p{qn))\ = [A'Kw) 
for every w e S*. □ 

Next we prove the second step of the proof. 

Lemma 8. For every unambiguous BAC-CRA A there exists an BAC-CRA A' 
that computes the same function, that is, [A](iu) = [A'](ru) for every w € S*. 

Proof. Before going into the main details, we need to introduce standard nota¬ 
tion for trees and parse trees that will be useful during this proof. 

Trees. Let A be a set of labels. An (unordered) labeled A-tree t is a finite 
function t : nodes(t) ^ S such that nodes(t) is a finite prefix-closed subset of 
N*, that is, w e nodes(t) whenever w-i e nodes(t) for some i e N. Additionally we 
assume that w ■ i implies w-{i-l) e nodes(<) for i > 0. We say that the e-word is 
the root of t and that w -i e nodes(t) is a child of w. Further, we write labels(t) 
to denote the set labels of a tree t. For any node w e nodes(t), we denote by 
^[■u;] the subtree rooted at w, i.e. the labeled tree t['u;] : nodes(t['u;]) ^ S such 



that t[w](i) = t{w-i) for every i e nodes(f[re]). We usually write a{ti,... ,tk} to 
denote a tree whose root is labeled by a and ti,... ,tk are the subtrees hanging 
from the root (i.e. for every i < k there exists j e N such that t[j] = ti). We 
say that w e nodes(t) is an internal node ot t ii w ■ i e nodes(f) for some i € N. 
Otherwise, w is called a leaf of t. The set of all leaves of t is denoted by leaves(f). 
We say that a tree is complete if any internal nodes has at least two children. 
One can easily check that if t is a complete tree, then | nodes(t)| < 2 • | leaves(t)|. 
Finally, we denote by Trees(T’) the set of all Z’-trees. 

Parse trees. Let labs = S' u {©, ©} u fb be the a set of labels in S and variables 
X. A parse tree is a complete labeled labs-tree p whose leaves are labeled by 
SvJ X and internal nodes by {©, ©}. We denote by Parse(A’) the set of all labs- 
parse trees. For any expression e e Expr(A’), the parse tree of e is a tree Pe 
recursively dehned as Pe = e whenever e is equal to a constant or variable, and 
Pe = ^{penPe^} whenever e = ei ® 62 where ® e {©,©}. 

Conversely, any parse tree p can be converted into an equivalent expression 
exp(p). Ifp is a single node then e= p otherwise Up = ®{pi,... ,pk} for © e {©, ©} 
then exp(p) = exp(pi). So far we used © and © as binary operations thus 

the equivalent expressions are not dehned uniquely. We allow for this because this 
dehnition is unique up to evaluation equivalence since © and © are commutative 
and associative. We dehne the size of an expression e as the number of nodes in 
the parse tree Pe- 

Example 3. Consider the expression e = ((xq Q {y Q 2)) © 3) © (z © 4) where 
x,y,z e X and 2,3,4 6 S. One can easily check that the parse tree pe of e is the 
following: 


X 


/ \ 

0 3 

/ \ 

© 

/ \ 

y 2 


0 


/ \ 

2 4 


Internal nodes of a parse tree can be merged by applying associativity and com¬ 
mutativity of © and ©, respectively. Formally, for any label ® 6 {©, ©} dehne the 
battening function hat® such that hat®(®{pi,... ,Pk}) is equal to {pi,... ,pk} 
whenever ® = ® and ®{pi,... ,pk} otherwise, for any label ® 6 {©, ©} and trees 
Pi,... ,Pk. Then, given a parse tree p we denote by p* the reduced parse tree 
constructed recursively as follows: p* = ®{hat 0 (pj),... ,hat®(p^)} whenever 
p = ®{pi, • ■ • ,Pk} and p* = p whenever p* is equal to a variable or constant. 
By the construction of p* , the label of any node of p* is different to all root 
labels of its children, i.e. p*{e) + P*{i) for any i 6 nodes(p*) n N. Or equivalently 
if ® 6 {©,©} is a label of a internal node then it does not have a child labeled 
with ®. 



Proposition 5. Let p he a parse tree. The expressions exp(p) and exp(p*) are 
equivalent and the alternation of these expressions is the same. 

Proof. We prove this by induction on the depth of p. If the depth is 0 then 
p = p* hy definition. Let p = ... ,pk} and p* = ( 8 >{flat 0 (pi),... ,flat 0 (p^)}. 

Let p' = ... ,p^}. By the induction assumption we know that exp(p) and 

exp(p') are equivalent and the alternation of these expressions is the same. To 
prove the proposition we need to show that this also holds between exp(p') and 
exp(p*). Let p* = ®{pi^i,... ,Pi,ki}- It suffices to prove that for every i the parse 
trees p' and p*’* = ®{pi,..., flat 0 (p*),... ,p^} define equivalent expressions with 
the same alternation. If ® # ® then these parse trees are the same. Otherwise 
p*’* = ®{p^,... ,pi,i,... ,Pi,ki,- ■ ■ ,Pfc}- Then by definition 

exp(p') = 0 exp(p*) « 0 exp(p*) ® exp(p*) = 0 exp(p*) ® 0 exp(p,,j) p*’* 

3 3*i 3*i 3 

where is the relation of equivalent expressions. These equalities follow from 
the commutativity and associativity of ®. 

We show that p' and p*’* have the same alternation. Formally the definition 
of alternation was for binary operators © and 0 . It is easy to generalize it for 
the ( 8 > notation. We define /®(e) = 1 if e = QjCj, /©(e) = 1 if e = ©j Cj and 
/(e) = 0 otherwise. The alternation Alt is defined as Alt(c) = 0 for every c € S' 
and Alt(( 8 )j ej) = maxj{Alt(ej) + /®(ej)} + 1 for ® e {©, ©} and any expressions 
Cj over S. 

Alt(p') = max{Alt(ea:p(p*)) + /®(exp(p*))} + 1 = 

3 

max{max{Alt(ea;p(p*)) + / 0 (exp(pj))}, Alt(ea:p(p*)) + /®(exp(p*))} + 1 

3*^ 

Since / 0 (exp(p*)) = 0 then this is equivalent to 

max{max{Alt(ea;p(p*)) + /0(exp(p*))}, Alt(ea;p(p*))} + 1 = 

3*i 

max{ max{ Alt (exp(p )))+/0 (exp ())}, max{ Alt (pi, ^)+/0 (pi j)} }+1 = Alt (p* ’ *) 

j 

□ 

By Proposition 5 the alternation of an expression e is equivalent to the depth of 
its reduced parse tree p*. From now, we will implicitly assume that parse trees 
are always in their reduced form (i.e. p = p* for every p € Parse(A’ u 3^)). 

Example f. Recall the parse tree pe of the expression e. The following tree is the 
reduced parse tree p*: 




0 


/ \ 

« 4 


X 


2 


One of the main ingredients of the proof are trees labeled by substitutions. 
More precisely, for two disjoint set of variables X\ and X-i, define Subs(ffi, ft 2 ) 
to be the set of all copyless substitutions a '■ X\^ Expr(ffiuff 2 ), that is, copyless 
substitutions where the domain contains only variables in X\. Here, composition 
between substitutions in Subs(ffi, X 2 ) is defined in a straightforward way where 
ff 2 -variables are treated as constants. Notice that the composition of two sub¬ 
stitutions is copyless for the registers Xi but not necessarily for the registers X 2 . 
This is because the registers X 2 are not in the domain of these substitutions. 

The set Subs(ffi, ff 2 ) is used to label trees. Formally, we denote by Trees(d:’i, ft 2 ) 
the set of all trees labeled by Subs(ffi, ^ 2 ). We usually called a tree t e Trees(d:’i, ft 2 ) 
a substitution tree. Furthermore, we say that t is copyless if t(u) is a copyless 
substitution for every u € nodes(t) and : 

Var(t(M)) n Var(t(w)) £ ( 17 ) 

for every different u,v ^ nodes(t) where Var(t(u)) and Var(t(?;)) are the set of 
all variables in t{u) and t(v), respectively. In other words, in addition that each 
substitution is copy less, each ft 2 -variable is used at most once in a substitution 
tree t (note that there is no restriction in Xi). Notice that with this restriction 
the composition of substitutions that are labels of different nodes is copyless also 
for the registers X 2 . 

We say that t is constants-free if, for every u e nodes(t) the substitution t(u) 
does not use constants from the semiring. Finally, for any node u € nodes(t) we 
define the collapse operation t\u) to be the expression: 

t\u) = t(e) o 1]) o ... o /c]) 

where k = |u| and is the prefix of u until position i. In other words, t\u) 

is the composition of all substitution along the branch from the root until u. 
Notice that this composition is a copyless substitution because of the condition 
(17). 

Let A = (Q, S, X, (5, go, p) be an unambiguous BAC-CRA whose alterna¬ 
tion is bounded by N. To construct a deterministic BAC-CRA A' from A, the 
idea is to keep in memory a tree of runs that encode how runs are branching 
when the word is read. Of course, if we keep the branching of all runs in memory, 
the tree would be unbounded. A characteristic property of unambiguous CRA is 
that, for each prefix, there is at most \Q\ active runs and, furthermore, the last 
state of all of them are different (Lemma 6 ). These two facts suggest a tree struc¬ 
ture of the runs where each branch is a run and where the number of branches is 
bounded by |Q|. Although we can keep in finite memory the branching structure 
of the actives runs of A, we cannot do the same trick over the registers A and 
naively keep copies of registers for each run (recall that A! must be copyless). 

To overcome this problem. A! will postpone the substitutions of each run by 
labeling internal tree structure of runs with substitutions. Clearly, A! cannot 
postpone these substitutions forever and store a long sequence of these objects 
with finite memory. The key idea here is to prune and reduce the tree structure 


by doing partial evaluation of the substitutions whenever is possible. We will 
show that by exploiting the bounded alternation of the output and the copyless 
restriction of A, we need only finite amount of memory to remember the tree 
structure and the substitutions of all runs. 

Recall that X is the set of registers in A. Let X he a. disjoint copy of X. 

Formally, we define X = {x \ x e X} to have a natural correspondence be¬ 

tween the registers in X and X. We construct a deterministic BAC-CRA A' = 
as follows. 

— Q’ is the set of all pairs {t, B) where t € Trees(A’, jV) is a complete, copyless, 
and constants-free substitution tree and B : leaves(t) Q is an injective 
function. Additionally we assume that \range{B) n FI = 1 . In the end we 
restrict Q' to its subset of reachable states Q'^. 

— 3^ is a set of registers of size |3^| = 2 • \Q\ ■ \X\ ■ {\X\ • TV + 1) satisfying X cy 

and X ny = 0 (recall that A is a copy of A). 

— Qq = (to,Ro) is the initial state of A' where to is a single-node tree labeled 
with CTo € Subs(A,3T) and i3o(e) = go such that <jo{x) = x for every x e A, 
where x is the copy of x in A. 

— Aq .y ^ S is the initial substitution defined as t'o(x) = vo{x) for every x € A 
and r'o(y) = 0 for every y ey \ X. 

— y' is the hnal substitution defined for every (t, B) e Q' by: 

y.'{t,B) = A{u)oy{q) 

where u € leaves(t) is the only leaf satisfying B{u) € F and q = B{u). 

— S' is the transition function that will be conveniently dehned in the sequel. 

We start explaining the relation between the set of registers A, A and y. 
The A-variables in this setting represent A-variables from A inside the internal 
structure of A! (i.e. states). In fact, A-variables will never be used as real vari¬ 
ables by A!■ They will just be used to keep track of temporary substitutions. 
Regarding y, the decision of taking the size of y equal to 2-\Q\- |A| • (|A| • TV + 1) 
is technical and will be clear later in the proof. The property X £ y is needed 
for the definition of the initial substitution. Besides that we will not use the fact 
that A £ 3 ;. 

Each state in Q' is composed by a complete, copyless, and constants free sub¬ 
stitution tree t and an injective function B : leaves(t) -> Q. As was suggested 
before, t keeps track of the branching history of the active runs of A and B 
labels leaves of t with states in Q. The plan here is that each active run of A is 
represented by a different branch of t and the state assigned by B to the leaf of 
the branch represents the current state of the run. 

Recall that Q'^ £ Q' is the set of reachable states in A!. For a given word 
w consider the set of active runs in A. By Lemma 6 every active run on w is 


determined by its last state. We define Q{w) £ Q as the set of last states in 
the active runs in A. To understand the main purpose of we state the 

following Lemma. Its proof is postponed to the end of this section. 

Lemma 9. Consider one of the runs of A on w and let he its configura¬ 
tion. Then there is a run of A' on w reaching configuration ((t, , where f 

is a valuation over y. Moreover Q{w) = range{B) and for all q € Q{w) there 
exists u € leaves(t) such that B{u) = q and 

u = t\u). 

In other words, we can recover the configuration (g, v) from a leaf u by applying 
B over u and by composing the substitutions of the branch from the root of t 
into u. 

We show that Q' is a finite set. We bound the size of any state {t,B) 6 Q' in 
terms of the values |Q|, |df|, and N. First, the number of nodes in t is bounded by 
2 • IQI, that is, |nodes(t)| < 2 • |leaves(t)| < 2 • IQ] since t is a complete tree and B 
is an injective function over leaves(t). Second, each copyless and constants-free 
expression that one can write with \X ujy| variables is of size at most 2-(|df| + |3^|). 
Recall that the size of an expression is defined as the number of nodes in its parse 
tree and parse trees are complete trees. This means that the size of any copy less 
and constants-free substitutions a e Subs(<T,3^) satisfies \a\ < 2 ■ \X\ ■ {\X\ + |3^|) 
and thus the number of possible labels for t is finite. Finally, B is an injective 
function from leaves(t) into Q. Given that t is of bounded size, then B is also 
bounded. 

Before we define the transition function S' we need to give a battery of defi¬ 
nitions. A partial substitution a '■ y ^ Expr(3^) is a substitution whose domain 
is a subset of y. For an expression e we use the notation Var;y(e) = Var(e) n y, 
i.e., the set of variables from y. The key idea in the construction of S' is the 
notion of an A-reduction. An A-reduction of p € Parse(A u 3^) is a tuple (r, cr) 
where r € Parse(A u jP) is a parse tree without constants and a :y ^ Expr(jy) is 
a partial substitution such that (T(exp(r)) is equivalent to exp(p). Moreover the 
domain of a is equal to Varj;(exp(r)) and exp(r) is a copyless expression. The 
last requirement ensures that each variable from 3^ occurs in r at most once. 

The goal of an A-reduction is to factorize constants into 3^-variables in such 
a way that we do not loose the previous tree. Of course, an A-reduction could 
increment the number of 3^-variables introduced to remove constants. The trick 
here is to construct, for each parse tree p, an A-reduction (r, cr) such that the 
number of nodes of r depends only on \N\ and |A|. We define this A-reduction 
by induction over the depth of the parse tree by using a function red;r(') that 
receives a parse tree p and produces a tuple (r,cr). For this definition we assume 
that 3^ is possibly infinite. Later, in Lemma 10, we show that we need only a 
finite set of labels, which proves that this definition is correct. 

For the base case, if p = x € A then we define red^rCp) = (^>^ 0 ) where <70 is 
the empty function. Otherwise if p is a constant or a variable from 3^ then we 


choose a variable y ^ y and define ledxip) = {y,a), where a is defined only on 
y by cT{y) =p. 

For the induction step we need to be able to relabel rV-reductions. Let cr be a 
partial substitution whose domain is A £ 3^ and let (r, a) be an A’-reduction of p. 
We say that (r', a') is a relabeling of (r, cr) if the domain of cr' is A' and there is a 
bijection f ■■ A' ->■ A such that a'{x) = and f{r') = r, that is substituting 

variables from A' into variables from A changes r' to r. By definition a relabeling 
is changing the domain of a but the codomain of cr is the same. Moreover (r', cr') 
is also an A’-reduction of p. 

Suppose that p is of the form p = ®{pi,... where each pi € 

Trees (dl contains at least one variable in X and each p' € Trees (3^) contains 

no variables in X. Furthermore, suppose that redxipi) = {'ri,cri ),..., iedx{pj) = 
{rj,aj) are already defined. We relabel the rT-reductions (pi,cri) to {Pi,<7i) in 
such a way that the domains of cr' are pairwise disjoint. Notice that we need 
to have enough labels in 3^ to do such a relabeling. In Lemma 10 we show that 
there are enough labels in 3^. We define redxip) - (^w) recursively as follows: 

r = ®{r[,...,r'j,y} (18) 

cr = (cr( U ... U ct' )[2/ ^ exp ( ® {p[,. . . ,Pfc})] 

where y € 3^ is a fresh variable not used in cti ,..., tTj and (cri u...ucrj ) corresponds 
to the disjoint union of the substitutions. Recall that ai have disjoint domains. 
If fc = 0 (there are no subtrees without label from X) then we do not add the 
additional variable y. 

In this construction the subtrees ,..., that do not use X registers are 
replaced by a new fresh variable y and the content ®{pi,... ,p(.} is assigned into 
y. It is clear from the definition that (r, cr) is an R’-reduction for p. Since ct' have 
disjoint domains and y was chosen as a fresh variable then exp(r) is copyless 
and the domain of cr is equal to Xi Vary(exp(r')) u {y} = Vary(r). 

Example 5. Recall the reduced parse tree p* of the expression e in Example 4. 
Suppose that x € X and y,z ^y. Then the rT-reduction redA'(Pe) = (^w) of pI 
is equal to: 


r: ^ cr: u ■■= y Q 2 

\ c;:=3ffi(z04) 

/ \ 

X u 

where u and v are fresh variables in 3^. One can easily check that exp(p*) = 
(7(exp(r)) and, thus, the expression defined by p* is preserved in (r, cr). 

We say that a partial substitution is copyless if it is copyless with respect to 
the variables in its domain. We prove that the size of r depends on the size of 
R” and N. 


Lemma 10. Suppose that p is a parse tree of a copyless expression in Expr( A” u 
y) whose alternation is bounded by N. Ifredxip) ~ then: r is a parse tree 

of a copyless expression with alternation bounded by N; a is a partial copyless 
substitution; and the number of y-variables in r is bounded by \X\ ■ N + 1. 

Proof. Suppose that redA:'(p) = {r,a). By the definition of redAf(-) it is easy to 
check that r is copy less and its alternation is bounded by In fact, each time 
that a subtree is replaced in the original tree p, we use a new fresh variable, 
that is, each variable in r is used at most one time. Notice that in our recursive 
definition of red^:- there was at most one recursive step for every internal node. 
In the definition of partial substitutions we assign subtrees of internal nodes to 
a new fresh variable from y. The recursion is not called in the subtrees assigned 
to the new fresh variable thus for different variables x the expressions a(x) 
correspond to different fragments of the tree p. Since p is copyless this proves 
that a is also copyless. 

The most interesting part is the bound of 3^-variables in r. We define Xr = 
Var(exp(r))nT’, i.e., the labels from X used in r. Similarly let W = Var(exp(r))n 
y, i.e., the ^variables used in r. Since exp(r) is copyless then every label from 
Xr and W occurs only once in r. We prove a slightly stronger variant of the 
inequality in the lemma. We show that |W| ^ |<W| • iV + 1. Since \Xr\ < \X\ this 
proves the lemma. 

The proof is by induction over the alternation of p. For the base case we have 
|nodes(r)| = 1 thus the bound is trivially true. For the inductive step suppose 
that the result holds for trees of alternation at most N. Take a parse tree p with 
alternation + 1 of the form: 

where each pi e Parse(T’ u 3^) contains at least one variable in X and each 
p' € Parse(3^) contains no variables in X. Since we assumed that parse trees are 
in their reduced form the alternation of eachp^ is at most N. Using the induction 
assumption on the parse trees pi we get \yp^ \ < \Xp^ \ • A^ + 1 for every i. 

Since the labels in Xp^ occur also in Xr we get the following 

t\yw\ ^ ^ \Xr\-N + J. 

i=l i=l 

The set W is divided into the set of all nodes in and possibly the additional 
node y added for the remaining expressions p[. Notice that the internal nodes 
are labeled with © and © thus the root of r does not contribute to the sum. We 
get the following: 

|3^r| < E|3 ^pJ + 1 < \Xr\-N + j + l 

i=l 

Notice that since in every parse tree pi there is at least one node with a label 
from X. This gives us j < |TV|. Using this bound we get 



which proves the induction step. 


□ 


We can naturally extend the definition of reduction red;r to be defined on ex¬ 
pressions e € Expr(d:’u3^). The result is defined as (e', a), where e' = exp(redA^(_Pe)) 
Furthermore, we extend the function redA:’(-) from expressions Expr(df u 3^) 
to substitutions in Subs(rT,3^). More precisely, for any a e Subs(T’,3^) define 
redx{a) = {a',aa) such that redAf(a(x)) = {a'{x),ax) for every x e X where 
a a is the disjoint union of all assignments of the form ax- If the domains of ax 
are not disjoint then we relabel the reductions {a'{x),ax) in such a way that 
the domains are disjoint. For simplicity we assume that (a'{x).,ax) are already 
relabeled. The domain of a^ is the sum of all domains of ax- Finally, we also ex¬ 
tend redAf(-) from substitutions to substitution trees such that redAr(t) = (t'^at) 
where nodes(t) = nodes(t'), redAf(t(M)) = (t'(u),cr„) for each u e nodes(t), and 
at is the disjoint union of all substitutions a^- Similarly if the domains of the 
substitution were not disjoint then we can relabel the reductions. The domain 
of at is the sum of all domains of tT„. We extend the partial substitution at to a 
substitution by 




at(x) if X is in domain of a 
0 otherwise. 


Notice that at{t') = a[(t') since the set of ^variables in t' is the domain of cr*. 
We assume that the reduction red^r on substitution trees returns an extended 
substitution defined on y. The following lemma, regarding red^rCO over substi¬ 
tution trees, will be useful later in the correctness proof of 5'- 

Lemma 11. For any copyless substitution tree t e Trees (A", 3^), if ledxit) = 
{t',at), then t' is a copyless and constants-free substitution tree and at is a 
copyless partial substitution. Furthermore, the number of y-variables in t' is 
bounded by |nodes(t)| ■ \X\ ■ {\X\ ■ N + 1). 


Proof- We start from showing that at is copy less. Suppose contrary that there 
exist two registers x,y such that Var(cri(a;)) n Var(cri(y)) + 0. We can assume 
that both registers x and y are used in t' because otherwise at{x) = © or at{y) = © 
and Var(©) = 0, which is a contradiction. By definition at(x) = a^^x) = az(x) 
and at(y) = au'(y) = cTz'iy) for some u,u' et and z,z' € X, where az{x),az'{y) 
are disjoint fragments of expressions t{u){x) and t{u'){y)- If u u' then this 
violates the condition (17). If u = u' and z + z' then it violates the condition 
that t{u) is a copyless substitution. Otherwise u = u' and z = z' then x and y are 
from the same parse tree. It violates Lemma 10 which proves that az is copyless. 

Let redA:’(p) = (r, tr) be a reduction on parse trees. By definition the set of 
3^-variables in r is the domain of a and each 3^-variable occurs only once in r. 
The reduction ledx on t uses fresh variables for every parse tree. Thus the set of 
3^-variable used in t' is the domain of at and each variable occurs at most once. 
This proves that t' is copyless. Since each parse tree in t' is without constants 
then the tree t' is also constant-free. 


Regarding the number of fresh 3^-variables needed to construct t', we know by 
Lemma 10 that red^(-) applied over an expression with alternation bounded by 
N needs |A”! • iV + 1 new fresh variables. Then red^r (•) applied over a substitution 
in Subs(rT,3^) needs at most \X\ ■ {\X\ • TV + 1) fresh 3^-variables and over a 
substitution tree t at most | nodes (t)| •|TL|-(|TL|-iV + l) fresh TV-variables. □ 

Comming back to the definition of the transition function 5' of A' ^ recall that 
states of A' are of the form {t, B) where t is a complete, copyless, and constants- 
free substitution tree and B : leaves(t) ^ Q is an injective function. For each 
transition of the form S'{t,B) = we show how to convert t into t' 

and how to update B into B' by a composing four different processes: extend, 
prune, shrink, and reduce. In the sequel, we explain each procedure in detail. 

(Extend). The first step is to extend branches in {t,B) to the next states 
when reading a letter a e E*. We define this process formally by the function 
extend((t,S),a) = {ti,Bi) that receives a state {t,B) e Q' and a letter a e E* 
and outputs a pair (ti, Bi) where ti is a substitution tree and Bi : leaves(ti) ^ Q 
is a partial injective function. The substitution tree ti is defined as an extension 
of t (i.e. nodes(t) £ nodes(ti)) such that ti{u) = t{u) whenever u e nodes(t) and 
for every v e leaves(t), if there exists a transition (B(v)^a,q^a) e 6 , then there 
exists i e N such that ti{v ■ i) = a. The function Bi is defined only on the new 
leaves by Bi{v-i) = q. Intuitively, extend((t,R), a) = (ti,Ri) extends t whenever 
the state on a leaf of t can evolve to a new state by reading a. Notice that 5 is 
not deterministic and a leaf v e leaves(t) could be extended with more than one 
nodes. We assume that the trees are unordered thus extend is a deterministic 
procedure. 

(Prune). The problem of the function extend((t, B),a) = (ti, Ri) is that there 
could exist leaves in ti that are not marked by the function Ri and therefore 
(ti,Ri) i Q'. This happens when for a state B{v) from a leaf v there is no 
transition (B(v)^a,q^a) for any q and cr, and this branch becomes a dead run. 
This is the purpose of the function prune(ti,Ri) = (t 2 ,R 2 ): to prune branches 
that are dead and to update Ri into a total function R 2 . Formally, ^2 is an 
induced substitution tree of t 2 (i.e. nodes(t 2 ) £ nodes(ti) and t 2 {u) = ti{u) for 
every u e nodes(t 2 )) such that u e nodes(t 2 ) iff r: • u € dom(Ri) for some u e N*. 
In other words, we keep only nodes that are ancestors of leaves that are marked 
by Ri and the remaining nodes are pruned. Finally, we define R 2 : leaves(t 2 ) ^ Q 
by B 2 = Ri. This dehnition is correct because we did not remove any node from 
the domain of Ri. Moreover the paths from the root to leaves in t 2 did not 
change and in particular for every u e leaves(t 2 ) we have t^iu) = 

(Shrink). By adding and removing branches with the procedures extend and 
prune it could happen that t 2 is not a complete tree and (^ 2 , R 2 ) i Q' (i.e. ^2 could 
contain internal nodes with just one child). The problem with these nodes is that 
they are redundant despite that they can be easily removed from ti by shrinking 
into one node. For this purpose, we define the procedure shrink(t 2 , R 2 ) = (^ 3 , R 3 ) 
recursively. We show by induction on the depth of t 2 the following properties: is 
a complete tree; R 3 is a injective function from leaves(t 3 ) into |Q|; rangei^B^) = 


range{B 2 ) and that for every u e leaves(t 2 ) there is u' e leaves(t 3 ) such that 
i? 2 (u) = B 3 (u') and t^iu) = 4(^0- particular this means that the size of ts is 
bounded by 2 • |(5|. 

For trees that have only one node we define shrink as the identity function 
and the properties are kept trivially. Suppose ^2 = cr{ri,...,r„} for some a e 
Subs(fb,3^). For every j € {1,... ,n} let ij € N be the node in t 2 corresponding 
to the root of Then for every leaf u € leaves(rj) the node ij • u is a leaf 
in < 2 - Moreover leaves(f 2 ) = • u | u € leaves(rj)}. We define the functions 

Cj : leaves(rj) ^ Q by Cj{u) = B2(ij-u). Notice that range{B2) = Uj range(Cj). 

Ifu > 1 then shrink(t 2 , B 2 ) = ... ,r'^},B 3 ), whereshrink(rj, Q) = {r'j,C'j) 

and B'i{ij ■ u) = C'j{u) for every j and u € Cj. By induction the properties are 
kept in shrink(rj,for all j. Then it is easy to see that they are also kept 
for shrink(t 2 ,B 2 ). The final case is when n = 1, for simplicity we skip the in¬ 
dexes and write t 2 = cr{r} and C : leaves(r) Q. Let r' be a tree such that 
nodes(r') = nodes(r), r'{u) = r{u) for every node u + e and r'(e) = a o r(e). 
Then we define shrink(t 2 ,B 2 ) = shrink(r', C). That is, the edge between the 
root of a{r} and its unique child r is shrunk. By induction the properties are 
kept in the step from {r',C) to (t^jB^) = shrink((r', C)). Thus we only have 
to prove that range{B^) = range{B 2 ) and that for every u e leaves(f 2 ) there 
is u' 6 leaves(t 3 ) such that t^iu) = 4(u'). The first property follows from the 
fact that range{B 2 ) = range{C) = range{B^'). To prove the second property let 
u € leaves(t 2 )- By definition there exists an i such that u = i-v and B 2 {i-v) = C{v). 
Let |u| = k then: 

4(u) = t2(e) o t2(u[-,l]) o t 2 (u[-, 2 ]) o ... o t 2 iu[-,k]) 

= a o t 2 (u[-, 1]) o t 2 {u[j 2 ]) o ... o t 2 {u[jk]) 
r'{e) 

= r'\v). 

By the induction assumption there is u' e leaves(t 3 ) such that r'^u) = 4(^0- 

(Reduce). The pair is almost ready after shrinking single-child inter¬ 

nal nodes, except that substitutions inside ^3 could have constants (e.g. extend(-) 
could have introduced constants). To solve this issue, we apply the fb-reduction 
procedure red.;r(') to reduce all substitutions in Formally, we dehne the pro¬ 
cedure reduce(t 3 ,B 3 ) = where redA:(t 3 ) = (t 4 ,cr) and B 3 = B 4 . 

The application of red^: over does not change its tree structure, in partic¬ 
ular nodes(t 3 ) = nodes(< 4 ), but the labels of the tree that are substitutions. 
Thus |nodes(t 4 )| = |nodes(t 3 )| < 2-\Q\. By Lemma 11, this implies that the 
number of fresh ^variables needed to apply the procedure redA:’(-) is at most 
I nodes(t 3 )| • |fb| • (Ifbl • iV -t-1) =2-\Q\- \X\ ■ (\X\ ■ N + 1) which is exactly the size of 
y. The reduction red^: often chooses fresh variables which is not deterministic. 
To determine this procedure for every given tree we set a deterministic choice 
of every fresh variable. With this change the procedure reduce is deterministic. 





With the definitions of the procedures extend, prune, shrink and reduce, we 
are ready to define formally the transition function 5' of A' . Specifically, for 
every state {t,B) € Q' and every a € E* we define: 

= reduce( shrink( prune( extend((t,-B),a) ) ) ) = ((t4,B4),cr) 

additionally we require that \range{B4) n_F| = 1 and that B4 is injective. Other¬ 
wise B),a) is undefined. All procedures: reduce, shrink, prune, extend are 
deterministic so S' is also deterministic. In the next lemma, we show that the 
definition of S' is correct. 

Lemma 12. For any {t,B) € Q' and a € A let: 

reduce( shrink( prune( extend((t,i?),a) ) ) ) = ((O,B4),cr). 

If \range{B 4 ) n = 1 and B 4 is injective then {ti^BA} € Q' and a is a copyless 
substitution over y. 

Proof. Assume that we have (ti, Bi) = extend((t,B),a), (t2,B2) = prune(ti, Bi), 
(^3) S3) = shrink(t2, B2) and (t4, B4) = reduce(<3, B3). We first check that t4 is a 
complete, copyless, and constants-free substitution tree in Trees(A’, 3 ^). We start 
from showing that O is complete. The tree t^ is a result of prune(t2), which by 
definition is a complete tree. This proves completeness because O has the same 
structure as t^. 

We show that O is copyless and constant-free. The procedure extend(-) intro¬ 
duces new variables in t only from X in new nodes. We label the new nodes with 
substitutions a from A, which by definition are substitutions from Subs(A’, 3 ^). 
Moreover it does not introduce new variables from y thus the tree is copy less. 
The procedures prune(-) and shrink(-) do not introduce new variables in ti, and 
^2- This shows that the tree t^ is copy less. By Lemma 11 we conclude that t 4 is 
copyless and constants-free. Since we assumed that |range(B4) n B| = 1 and B4 
is injective then this proves that (f4,B4) 6 Q' . Moreover, Lemma 11 also implies 
that O' is a copyless substitution. □ 

Now we have all the ingredients to prove Lemma 9 . 

Proof (of Lemma 9). The lemma is proved by induction over the size of w € A*. 
For the base case w = e by definition q'^ = (to, Bg) where to is a single-node tree 
labeled with erg £ Subs(A’, 3 ^^ and Bg(e) = qo, where crg(a:) = x for every x e X, 
where x is the copy of x in X. The initial function zzg is defined as r'o(^) ~ ^o(x) 
for every x e X and ^^{y') = 0 for every y & y \ X. In this setting 

£, o t\u)(x) = v'qO cro(a;) = iZgCi) = vo{x). 

In the initial configuration there is only one run which ends in qo and range{Bo) = 
{go}, which finishes the proof of the base case. 


For the induction step suppose that the lemma holds for a word w. Let a € E, 
we show that the lemma also holds for w ■ a. Let {q',v') be a configuration 
reached by a run of A over w ■ a. By Lemma 6 there is a unique run of A on 
w that ends in a configuration such that {q,a,q',a) € S and v' = v o a. 

By the induction assumption we have that there is a run of A' on w reaching 
conhguration such that Q{w) = range{B) and for every x e X that 

there exists u e nodes(t) such that -B(u) = q and 

V = (19) 


Let {ti,Bi) = extend((Li?),a), (^2,^2) = prune(ti,Si), (^3,-83) = shrink(t2,S2) 
and {t4,B4,T) = reduce(t3, S3). By definition of extend the set range{Bi) is the 
set of all p such that (S(i;), a,p,Up) e 6 for some Cp and v e leaves(t). Since 
Q{w) = range{B) then Q{w ■ a) = range{Bi). Clearly by definition of the pro¬ 
cedures prune,shrink,reduce we have range{Bi) = range{B2) = range{B^) = 
range{B4). By the unambiguity of A we get that |ran(/e(S4)nS| = |( 5 (rt;-a)nS| = 

1 . Similarly to prove that S4 is injective it suffices to show that Si is injective. 
Suppose contrary that Si maps two different leaves to the same state p. Then 
by definition of Si there are at least two runs of ^ on ic • a that end in q, which 
is a contradiction with Lemma 6. 

By Lemma 12 we conclude that B),a) = (^4, S4) and that the automaton 
A' is in configuration ((t4, S4), ^or) after reading w-a. Since range{B4) = Q(w-a) 
then there exists i € IN such that ui € leaves(ti), Bi{u ■ i) = q' , and ti(m) = a. If 
we compose both sides of Equation 19 with cr then we get: 


1/ o a = ^ o t^{u) o a 



Thus we get 


v' = 


We know that the procedure prune(-) and shrink(-) preserve the outputs of the 
collapse operation. Thus there exists v e nodes(t 3 ) such that B^{v) = B'{v) = q' 
and 

v' = £,ot\^{v). 

Given that (^ 4 , S 4 , r) = reduce(t 3 , S 3 ) is an T’-reduction, we know that Tot\^(^v) = 
By replacing the last equation in the above formula, we get 


A = ^ O T o 


which proves the induction step. 


□ 


Lemma 9 in particular proves that 6' is a total function when Q' is restricted to 
the reachable states Q'^.. For this reason we restrict the set of states to Q'^. With 
Q'j. as the set of states the automaton A' is a deterministic CRA. 



To conclude the proof we show that A and A! have the same output on every 
word w. Let ( 9 , 2 ^) be the configuration of the unique accepting run of on w 
and let be the configuration of A' of the run on w. By Lemma 9 there 

exists a node u such that u € leaves(t) such that B{u) = q and 

V = 

The output of A' on w is defined as ^ o ^'{t,B). By definition q € F and 
\range{B)nF\ = 1 thus q is the unique accepting state in leaves(t). By definition 
n'{t, B) = t^{u) o ii{q). Thus we get the following equalities for the output of A' 
on w. 


^ o /r'(L B)=^o t\u) o g{q) = vo g{q). 

This finishes the proof because the expression v o g,{q) is the output of A on w. 
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